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The  26S  proteasome,  composed  of  a  20S  proteolytic  core  and  19S  regulatory 
particle  (RP),  is  responsible  for  energy-dependent  degradation  of  ubiquitinated  proteins 
in  eukaryotic  cells.  The  20S  core  and  proteins  with  similarity  to  the  regulatory  particle 
triphosphatase  (Rpt)  subunits  of  the  RP  are  conserved  in  the  Archaea. 

This  study  reports  the  characterization  of  20S  proteasome  and  Rpt-like  proteins 
from  the  methanoarchaea  Methanococcus  jannaschii  and  Methanococcus  maripaludis. 
The  Rpt-like  protein  from  the  hyperthermohilic  M.  jannaschii  formed  a  550-kDa  complex 
which  stimulated  20S  proteasome-mediated  protein  hydrolysis  in  the  presence  of 
nucleotide  triphosphates  and  was  designated  MjPAN  (proteasome-activating 
nucleotidase).  When  compared  with  other  ATPases,  MjPAN  had  a  high  F^ax  for  ATP 
hydrolysis  [3,500  nmol  Pj  released  •  min"'  •  mg  protein"')  as  well  as  a  low  affinity  for 
ATP  iK„=497  ^M).  Deletion  of  residues  1  to  73  of  MjPAN  did  not  influence  complex 
formation  or  energy-dependent  proteolysis.  However  this  deletion  [MjPAN(Al-73)]  did 

xi 


increase  the  types  of  nucleotides  hydrolyzed  and  lowered  the  temperature  optimum  for 
ATP  hydrolysis.  Based  on  electron  microscopy,  both  MjPAN  and  MjPAN(Al-73) 
associated  with  the  ends  of  the  20S  proteasome,  resembling  proteasome  complexes 
isolated  from  eukaryotes.  These  results  suggest  that  archaeal  20S  proteasomes  require  a 
nucleotidase  complex  such  as  PAN  to  mediate  energy-dependent  proteolysis  of  folded 
proteins  in  the  cell. 

Methanococcus  maripaludis  is  a  mesophilic  methanoarchaeaon  for  which  genetic 
tools  and  high-yield  culture  techniques  have  been  developed.  Two  protein  complexes 
with  NEM-inhibitable  nucleotidase  activity  similar  to  MjPAN  were  identified  in  M 
maripaludis;  however,  neither  protein  cross-reacted  with  anti-MjPAN  antibody.  Thus,  a 
pan  gene,  which  encoded  a  protein  with  70%  identity  to  MjPAN,  was  isolated  from  M 
maripaludis.  The  MmPAN  protein,  which  was  purified  from  recombinant  Escherichia 
coli,  cross-reacted  with  an  anti-MjPAN  antibody  and  formed  a  600-kDa  complex.  The 
Mjpan  and  Mmpan  genes  were  also  synthesized  in  M  maripaludis.  This  increased  the 
level  of  PAN  protein  in  the  methanogen  as  detected  by  Western  blot  and  ATPase  assays 
of  cell  lysate.  Current  and  future  work  involves  using  the  constructs  developed  in  this 
study  to  further  characterize  PAN  proteins  and  to  identify  cellular  components  that 
interact  with  PAN  and  the  20S  proteasome. 


xii 


INTRODUCTION 

Proteolysis  is  now  understood  to  be  an  important  part  of  the  "life-cycle"  of  a 
protein  from  the  time  it  is  first  synthesized  on  the  ribosome,  to  the  time  the  protein  is 
degraded  and  its  amino  acids  are  recycled  to  be  re-used  in  biosynthetic  processes  by  the 
cell.  In  eukaryotes,  proteolysis  of  regulatory  proteins  such  as  cyclins  has  been  shown  to 
be  required  for  timely  progression  through  the  cell  cycle  (reviewed  in  167,  246)  and  is 
also  essential  in  degradation  of  foreign  proteins  for  antigen  presentation  during  the 
inmiune  response  (59).  Proteolysis  of  specific  transcription  factors  and  cell-division 
proteins  is  also  required  in  eubacterial  cells  for  cell  division  and  sporulation  (reviewed  in 
78, 177).  In  contrast  with  eukaryotic  cells,  prokaryotic  cells  lack  membrane-encased 
organelles,  such  as  lysosomes,  for  segregation  and  specific  degradation  of  cellular 
proteins.  Therefore,  the  necessity  for  proteases  which  separate  their  active  sites  away 
from  cytosolic  proteins  is  especially  important  in  these  organisms. 

Theoretically,  the  primary  amino  acid  sequences  of  proteins  contain  all  of  the 
information  necessary  for  proper  folding  (8).  However,  in  the  crowded  environment  of 
the  cell,  which  has  been  estimated  to  contain  at  least  30%  w/v  protein  (1 52),  both 
chaperones  and  proteases  are  needed  to  mediate  protein  folding  and,  when  necessary, 
protein  unfolding  and  degradation  (53).  On  the  surface,  protein  folding  and  assembly 
would  appear  to  be  completely  different  processes  than  protein  unfolding  and 
degradation.  However,  it  is  likely  that  some  of  the  same  proteins  involved  in  degradation 
of  protein  substrates  can  also  act  as  chaperones  to  aid  in  the  correct  folding  of  proteins 
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(reviewed  in  239,  300).  Obtaining  an  understanding  of  how  substrates  are  specifically 
partitioned  between  either  folding  or  proteolysis  and  determining  the  mechanisms 
involved  in  maintaining  the  proper  balance  of  protein  folding  and  degradation  in  the  cell 
have  become  the  focus  of  numerous  studies  (reviewed  in  80). 

Archaea  reside  in  a  wide  variety  of  often  extreme  environments;  therefore  their 
mechanisms  for  dealing  with  stress,  and  especially  their  proteolytic  and  chaperone 
systems,  are  of  particular  interest  (reviewed  in  154,  158).  In  this  study,  the  proteasome 
and  the  proteasome-activating  nucleotidase  (PAN)  were  studied  in  two  archaea, 
Methanococcus  jannaschii  and  Methanococcus  maripaludis.  Homologs  of  the 
proteasome  and  PAN  are  well  conserved  in  both  eukarya  and  archaea,  but  are  not  as  well 
conserved  in  the  eubacteria.  Archaea  are  similar  to  both  eubacteria  (in  their  metabolic 
processes  and  ultrastructure)  and  eukarya  (in  their  transcription  and  translation 
processes).  Therefore,  study  of  the  proteolytic  system  found  in  these  methanogens  and 
other  Archaea  should  add  to  an  overall  understanding  of  ATP-dependent  proteolysis  in  all 
three  phylogenetic  domains. 

The  overall  objective  of  this  project  was  to  obtain  a  better  understanding  of  the 
ATP-dependent  proteases,  especially  in  the  methanogenic  archaea,  with  special  emphasis 
on  the  hyperthermophile  Methanococcus  jannaschii  and  the  mesophile  Methanococcus 
maripaludis. 


LITERATURE  REVIEW 
This  review  of  the  literature  will  include  an  overview  of  proteolytic  and 
chaperone  systems  in  eukaryal,  archaeal,  and  eubacterial  cells,  with  a  particular  emphasis 
on  the  proteasome.  It  will  also  include  a  survey  of  the  proteolytic  and  chaperone 
machinery  present  in  the  archaea,  based  on  genomic  sequencing  as  well  as  direct 
experimentation.  It  will  conclude  with  a  review  of  genetic  studies  which  have  been 
performed  in  archaea,  with  a  special  emphasis  on  genetic  work  done  in  the  methanogens 
Methanococcus  voltae  and  Methanococcus  maripaludis. 

20S  Proteasomes 

The  proteasome  core,  with  a  sedimentation  coefficient  of  20S,  is  a  barrel-shaped, 
four-ringed  structure  which  is  highly  conserved  in  both  eucarya  and  archaea,  but  among 
eubacteria  has  been  found  only  in  the  gram-positive  actinomycetes  (243).  The  20S 
proteasome  was  first  identified  in  erythrocyte  cells  in  1968  (91)  and  since  then  has  been 
isolated  from  numerous  organisms  and  given  over  20  different  names,  including 
macropain  (161),  multicatalytic  proteinase  complex  or  MCP  (176),  and  low-molecular- 
weight  polypeptide  complex  or  LMP  (164).  The  name  proteasome  was  first  used  in  1988 
(10)  and  has  since  become  the  most  widely  used  name  for  this  complex.  The  proteolytic 
active  sites  of  the  proteasome  are  sequestered  within  an  internal  chamber,  which  prevents 
the  active  sites  from  coming  into  contact  with  cellular  proteins.  Substrate  proteins  can 
only  enter  the  protease  by  first  passing  through  a  very  small  opening  formed  by  the  outer 
rings  on  either  end  of  the  protease.  Therefore,  only  completely  unfolded  proteins  and 
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peptides  can  enter  the  proteasome,  where  they  are  then  degraded  in  an  ATP-independent 
reaction  (discussed  below). 

Archaeal  20S  Proteasomes.  The  20S  proteasome  from  Thermoplasma 
acidophilum  (41),  a  themoacidophilic  archaeon,  was  the  first  archaeal  20S  proteasome 
which  was  biochemically  characterized  or  produced  in  recombinant  E.  coli  (302).  Since 
then,  20S  proteasomes  have  been  purified  from  several  other  archaea  including 
Methanosarcina  thermophila  (157),  Pyrococcus  furiosus  (13),  and  Haloferax  volcanii 
(286).  In  addition,  each  archaeal  genome  which  has  been  sequenced  contains  ORFs 
predicted  to  encode  the  two  (a  and  P)  subunits  of  20S  proteasomes.  Most  archaeal 
proteasomes  are  presumed  to  contain  seven  a  or  P  subunits  per  ring.  The  a  subunits, 
which  form  the  outer  rings  of  the  four-ringed  complex,  limit  access  of  the  substrate  into 
the  inner  catalytic  chamber  formed  by  two  rings  of  seven  P  subunits  each  (Fig.  1). 

Crystal  structure  of  the  T.  acidophilum  20S  proteasome.  The  crystal  structure  of 
the  T.  acidophilum  20S  proteasome  has  been  solved  to  a  resolution  of  3. 4A  (151).  The 
overall  dimensions  of  this  proteasome  are  148  A  by  1 13A.  The  a  and  P  subunits  are  quite 
similar  in  their  tertiary  structure  and  crystallographic  fold,  each  consisting  of  two  five- 
stranded  antiparallel  p  sheets  with  flanking  a  helices,  even  though  they  have  low  primary 
sequence  identity  (<20%).  The  a  subunits  contain  an  additional  35  amino-acid  long  a 
helix  near  the  N-terminus,  and  the  N-terminal  12  residues  are  disordered  and  not  visible 
in  the  crystal  structure.  Seven  a  subunits  assemble  into  a  ring  with  an  opening  of  only 
13A  in  diameter.  This  opening  allows  only  completely  unfolded  substrates  to  enter  the 
proteasome  complex  and  come  in  contact  with  the  proteolytically  active  P  subunits. 


FIG.  1.  Predicted  structure  of  the  Methanosarcina  thermophila  20S  proteasome.  The 

image  is  an  atomic  model  based  on  the  crystal  structure  coordinates  of  20S  proteasomes 
from  Thermoplasma  acidophilum  and  S.  cerevisiae.  The  active  sites  involved  in  peptide 
bond  hydrolysis  are  indicated  in  black.  Computational  results  were  obtained  using 
software  programs  from  Molecular  Simulations  Inc.  (http://www.msi.com)  and  from  the 
Swiss  Institute  of  Bioinformatics  (http://www.expasy.ch). 
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The  interface  between  the  a?  and  ^^  rings  creates  an  opening  which  is  27  A  in  diameter, 
and  the  two  pv  rings  form  a  central  cavity  which  is  53A  in  diameter.  Based  on  the  crystal 
structure,  as  well  as  other  biochemical  inhibitor  and  mutagenesis  studies,  the  20S 
proteasome  was  determined  to  be  an  N-terminal  threonine  protease  (151,  156,  212).  Each 
P  subunit  has  one  active  site  as  demonstrated  by  binding  of  the  inhibitor  N-acetyl-Leu- 
Leu-norleucinal  (Ac-LLnL-al)  to  the  active  site  N-terminal  threonine  residues  in  the 
crystal  structure,  and  the  distance  between  each  active  site  is  approximately  28A.  The  N- 
terminal  threonine  serves  as  a  proton  acceptor  and  nucleophile,  with  glutamate'^  and 
lysine^^  forming  a  salt  bridge  to  complete  the  active  site.  Aspartate'^^  is  also  likely  to  be 
involved,  as  site-directed  mutagenesis  of  this  residue  inactivates  the  proteasome  without 
perturbances  in  the  overall  structure  (211). 

The  Ntn-hydrolase  family.  The  Ntn  (N-terminal  nucleophile)-hydrolase  family  of 
proteases  uses  the  side  chain  of  the  N-terminal  amino  acid  as  a  catalytic  nucleophile 
during  proteolysis  (23).  All  Ntn-hydrolases  are  synthesized  as  a  precursor  in  which  the  N- 
terminal  residue,  which  can  be  a  cysteine,  serine,  or  threonine,  is  exposed  by 
autocatalytic  processing  and  removal  of  a  propeptide.  Even  though  these  proteins  often 
have  no  detectable  primary  sequence  identity,  all  Ntn-hydrolases  which  have  been 
crystallized  have  a  similar  p-sandwich  fold  consisting  of  two  five-stranded  antiparallel 
P  sheets  with  a  helices  on  either  side  (19,  49,  84,  151).  This  fold  is  essential  for 
orientation  of  the  active  site  residues  including  the  N-terminal  nucleophile  as  well  as 
acidic  and  basic  residues  (Glu'^  and  Lys"  in  the  T.  acidophilum  proteasome)  which 
behave  as  a  salt  bridge  to  complete  the  catalytic  triad  (19,  49,  84,  151).  Most  20S 
proteasomes  and  the  eubacterial  homolog  HslUV  (discussed  below)  are  Ntn-hydrolases 
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with  N-terminal  threonine  residues.  Changing  the  N-terminal  threonine  of  the  T. 
acidophilum  20S  proteasome  to  serine  has  little  effect  on  the  cleavage  of  the  amide  bond 
of  small  fluorogenic  substrates  (156,  212).  However,  the  mutant  proteasome  was  less 
active  during  the  cleavage  of  full-length  proteins  (127)  and  was  not  as  stimulated  as  the 
native  20S  proteasome  by  PAN  (proteasome-activating  nucleotidase,  discussed  below)  in 
the  degradation  of  FITC-labeled  protein  substrates  (127). 

Eukaryotic  20S  proteasomes.  Eukaryotic  20S  proteasome  cores  are  more  complex 
than  archaeal  proteasomes,  with  seven  different  subunits  forming  each  ring  for  a  total  of 
14  different  subunits  per  20S  complex.  However,  each  individual  subunit  can  be 
classified  as  either  an  a-type  or  a  p-type  subunit,  based  on  sequence  similarity  to  the 
subunits  of  the  archaeal  proteasome.  Both  the  archaeal  and  eukaryotic  proteasomes 
contain  seven  a-type  subunits  form  the  outer  rings,  and  two  rings  of  seven  P-type 
subunits  create  the  inner  catalytic  chamber.  Interestingly,  eukaryotic  20S  proteasomes 
have  only  three  N-terminal  threonine-containing  active  P-type  subunits  (pi,  P2,  and  P5) 
per  ring,  for  a  total  of  six  active  sites  in  the  full  20S  complex.  In  antigen  presenting  cells 
of  higher  eukaryotes,  these  P  subunits  can  be  replaced  by  three  other  active  subunits 
(LMP2/pi;,  LMP7/P2/,  and  MECLl/pS,)  in  the  presence  of  interferon-y  (IFN-y)  to  form 
the  immunoproteasome  (1,  56,  58,  168).  Notably,  LMP2  and  LMP7  are  encoded  by  the 
major  histocompatibility  complex  (MHC)  class  II  locus  (229),  and  cells  deficient  in  these 
two  subunits  demonstrate  defects  in  antigen  presentation  (59).  Differences  observed 
when  these  subunits  are  incorporated  into  the  20S  proteasome  include  a  change  in  the 
peptide  cleavage  specificity,  with  a  decrease  in  cleavage  after  acidic  residues  (trypsin-like 
activity)  and  an  increase  in  cleavage  after  basic  and  hydrophobic  residues  (46).  It  has 
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been  determined  that  the  majority  (>73%)  of  peptides  bound  in  the  MHC  complex  have 
hydrophobic  or  aromatic  residues  at  their  C-termini  (169). 

Crystal  structure  of  the  20S  proteasome  of  yeast.  The  crystal  strucuture  of  the  20S 
proteasome  from  S.  cerevisiae  has  been  solved  to  a  resolution  of  2.4A  (84).  Unlike  the  T. 
acidophilum  20S  proteasome  in  which  the  N-terminal  residues  of  the  a  subunits  were 
disordered,  forming  a  13A  opening,  the  N-terminal  residues  of  the  yeast  proteasome 
could  be  resolved  and  appeared  to  completely  close  the  ends  of  the  cylinder.  This  was  not 
surprising,  since  proteasomes  purified  from  eukaryotes  are  usually  inactive  and  must  be 
treated  with  either  heat  or  detergents  such  as  SDS  for  hydrolysis  of  peptide  substrates 
(175, 222).  In  vivo,  eukaryotic  20S  proteasomes  are  presumed  to  be  stimulated  to  fiill 
activity  by  the  binding  of  regulatory  components  [such  as  the  19S  cap/regulatory  particle 
(RP)]  to  the  ends  of  the  cylinder  (97).  In  addition,  each  of  the  14  different  a  and  p 
subunits  were  found  to  be  in  a  defined  place  in  the  yeast  20S  proteasome  crystal 
structure,  as  had  been  previously  predicted  based  on  studies  using  immxmoelectron 
microscopy  (134,  135).  Notably,  only  the  three  active  p  subunits  contained  the  inhibitor 
Ac-LLnL-al  bound  to  the  N-terminal  threonine.  In  addition,  the  other  residues  essential 
for  activity  in  the  archaeal  proteasome  (Glu'^,  Lys^^  and  Asp'^^),  are  also  conserved  in 
these  three  active  subunits  of  the  yeast  20S  proteasome  (84). 

Eubacterial  proteasomes.  Proteasomes  have  also  been  purified  from  gram-positive 
actinomycetes,  one  of  which  (from  Rhodococcus  sp.)  is  of  intermediate  complexity, 
having  two  of  each  type  of  subunit  (243).  While  the  20S  proteasome  has  not  been  found 
in  gram-negative  eubacteria,  the  double-ringed  HslV  complex  has  been  termed  the 
"prokaryotic  proteasome",  based  on  its  N-terminal  threonine  active  site  and  the  similarity 


of  the  crytallographic  fold  of  its  subunits  to  the  (3  subunits  of  the  20S  proteasome 
(discussed  below). 

2QS  proteasome  assembly.  The  model  for  both  eukaryotic  and  archaeal 
proteasomes  is  that  the  a  subunits  of  the  complex  form  a  ring  which  provides  a 
scaffolding  upon  which  the  (3  subunits  are  assembled  (30,  31,  96).  Two  half-proteasomes 
formed  of  one  a  ring  and  one  P  ring  then  assemble  into  a  full  20S  proteasome.  Archaeal 
a  subunits,  when  synthesized  alone,  form  both  single  and  double  seven-membered  rings 
(156, 287,  301).  The  N-terminal  propeptides  of  the  P  subunits  are  removed 
autocatalytically  to  expose  an  active-site  threonine  during  assembly  of  the  20S  complex. 
If  P  subunits  are  synthesized  alone,  the  propeptide  is  not  removed  and  the  subunits 
generally  remain  as  inactive  monomers,  dimers,  and  trimers  (156,  287,  301).  While  the 
P  subunit  propeptide  may  serve  an  essential  role  in  the  archaeal  cytosol,  the  propeptide  is 
not  required  for  assembly  of  archaeal  20S  proteasomes  in  recombinant  E.  coli.  Synthesis 
of  the  P-subunit  without  the  propeptide  and  subsequent  mixing  of  equimolar 
concentrations  of  the  a  and  p  subunits  results  in  the  assembly  of  fully  functional  20S 
proteasome  complexes  (156, 287,  301). 

Some  of  the  proteasome  a-type  subunits  from  eukaryotes  can  also  form  single 
and  double  rings  when  synthesized  alone  (66,  67).  The  three  active  (N-terminal 
threonine-containing)  P-type  subunits  are  also  synthesized  as  inactive  precursors,  and  the 
prosequence  is  necessary  for  assembly  of  the  20S  proteasome  in  the  eukaryal  cell  (30).  In 
addition  to  mediating  proper  20S  proteasome  assembly,  the  propeptides  also  prevent  N"- 
acetylation  and  inactivation  of  the  N-terminal  catalytic  threonine  in  yeast  (9).  When  the 
P-type  subunit  Pre2/Doa3  (P5)  was  expressed  without  the  prosequence  in  yeast,  this 
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subimit  was  not  incorporated  into  the  proteasome  (31).  However,  the  prosequence  could 
operate  in  trans,  indicating  a  possible  chaperone-like  function  (31).  In  addition,  Umpl,  a 
small  protein  of  16.8  kDa,  has  also  been  identified  as  a  chaperone  which  interacts 
specifically  with  pre-processed  p  subunits  in  the  half-proteasome  and  is  necessary  for 
proper  assembly  of  the  20S  proteasome  in  yeast  (194).  The  Umpl  protein  is  internalized 
and  rapidly  degraded  by  the  proteasome  during  assembly  (194).  Transcription  of  umpl  is 
also  increased  in  mammalian  cells  in  the  presence  of  IFNy,  indicating  a  possible 
additional  role  for  Umpl  in  the  formation  of  the  immunoproteasome  (28). 

The  proteasome  fi-om  Rhodococcus  sp.  has  a  different  assembly  pathway  than 
either  eukaryal  or  archaeal  proteasomes  (160,  298).  The  a  subunits,  when  expressed 
alone,  do  not  form  a  ring.  Instead,  the  a  and  pre-processed  p  subunits  form  a  dimer, 
which  then  assembles  into  a  half-proteasome  made  up  of  a  double  ring  of  a  and 
p  subunits  (160, 298).  The  half-proteasomes  then  assemble  into  a  full  20S  complex 
during  which  the  propeptides  of  the  P  subunits  are  removed  (160, 298). 

26S  Proteasomes 

The  2-MDa  26S  proteasome  complex,  which  is  composed  of  the  20S  proteolytic 
core  as  well  as  an  RP  with  a  sedimentation  coefficient  of  19S,  has  been  found  thus  far 
only  in  eukaryotic  cells  (reviewed  in  40,  166, 246,  247).  It  was  first  identified  in  rabbit 
reticulocytes  based  on  its  ability  to  degrade  substrates  which  had  been  labeled  with 
ubiquitin  (101,  102),  and  subsequently  it  has  been  shown  that  the  majority  of  cytoplasmic 
proteins  are  degraded  by  the  26S  proteasome  in  eukaryotes  (94,  269).  In  addition,  the  26S 
proteasome  appears  to  be  the  primary  proteolytic  machinery  responsible  for  degrading 
foreign  proteins  into  peptides  for  presentation  in  the  MHC  Class  1  complex  (198). 


Numerous  proteins  have  been  identified  as  substrates  of  the  26S  complex,  and  many  of 
them  are  very  short-lived  regulatory  proteins,  such  as  cyclins,  transcription  factors,  and 
cylin-dependent  kinase  (cdk)  inhibitors  (reviewed  in  167, 246).  Notably,  the  26S 
proteasome  is  the  only  two-component  ATP-dependent  protease  identified  in  eukaryotic 
cells,  in  distinct  contrast  to  eubacterial  cells,  which  contain  several  (discussed  below). 

The  regulatory  particle  (RP)  of  the  eukaryotic  26S  proteasome.  The  regulatory 
particle  or  RP,  which  has  also  been  called  the  19S  cap  (188),  PA700  complex  (42),  or  the 
yL  particle  (129),  is  composed  of  ~15  subunits  ranging  in  size  from  25  to  110  kDa.  The  RP 
associates  with  20  S  proteasomes  in  an  ATP-dependent  manner  and  is  responsible  for  the 
recognition  and  translocation  of  substrates  into  the  proteolytic  20S  core  of  the  proteasome 
(reviewed  in  40,  166,  246, 247).  The  RP  can  be  further  divided  into  Base  and  Lid  sub- 
domains  (74).  The  Base  sub-domain,  which  directly  contacts  the  ends  of  the  20S 
proteasome,  contains  eight  subunits,  six  of  which  have  been  demonstrated  to  be  members 
of  the  AAA  (ATPases  associated  with  a  variety  of  cellular  activities)  family  of  proteins 
(42,  43).  These  AAA  proteins  of  the  Base  sub-domain  are  predicted  to  form  six- 
membered  rings  which  interact  directly  with  the  seven-membered  a-rings  of  the  20S 
proteasome  in  an  asymmetrical  mismatch.  As  discussed  previously,  eukaryotes  contain 
seven  different  a  subunits  in  the  outer  rings  of  the  20S  proteasome,  which  are  all  in  a 
well-defined  place  within  the  ring  (134),  and  the  six  AAA  components  of  the  Base  sub- 
domain  of  the  RP  have  also  been  shown  to  interact  specifically  with  one  another  (196)  in 
a  defined  place  within  the  complex  (74).  It  is  thus  likely  that  ATPases  of  the  RP  interact 
with  specific  a  subunits  of  the  20S  core,  albeit  in  an  assymetrical  mismatch.  Proposed 
fimctions  of  the  ATPase  components  include  coupling  of  ATP  hydrolysis  to  unfolding 


and  feeding  of  substrate  molecules  into  the  20S  core  of  the  proteasome  (72,  73).  In 
addition,  there  is  evidence  that  the  ATPases  of  the  Base  may  open  the  normally  closed 
gate  formed  by  the  a  subunits  in  the  yeast  proteasome  (83).  The  Lid  sub-domain  of  the 
RP  also  contains  several  subunits  that  have  been  characterized.  Several  of  the  Lid 
subunits  contain  coiled-coil  or  KEKE  motifs,  which  have  been  implicated  in  protein- 
protein  interactions  (196,  275).  In  general,  however,  less  is  known  about  the  function  of 
the  subunits  of  the  Lid,  as  there  is  little  sequence  conservation  among  them. 

Nomenclature  of  proteins  in  the  RP.  Several  different  systems  have  been  used  to 
name  the  subunits  in  the  RP,  including  numbering  the  15  subunits  originally  identified  in 
the  S.  cerevisiae  RP  as  SI  to  SI 5  (47).  However,  as  more  subunits  were  identified  from 
several  different  organisms,  the  names  became  more  ambiguous  and  confiising.  In 
addition,  it  is  now  becoming  clear  that  the  RP  is  not  a  well-defined  complex,  but  rather 
that  the  subunit  composition  of  the  RP  may  change  depending  on  the  specific  needs  and 
conditions  of  the  cell  (25).  Many  of  the  regulatory  proteins  were  originally  identified 
fi-om  unrelated  studies  of  yeast  mutants,  and  it  was  not  known  that  these  proteins  were 
also  members  of  the  RP.  Thus,  the  original  names  for  these  subunits  often  contained  no 
indication  that  they  were  members  of  the  RP  of  the  26S  proteasome.  Therefore,  a  "unified 
nomenclature"  was  developed  for  the  proteins  in  the  RP  (55).  The  six  ATPase  members 
of  the  Base  sub-domain  of  the  RP  were  renamed  Rptlp  to  Rpt6p,  for  regulatory  particle 
triphosphatase  proteins  (55).  For  example,  Rpt2p  is  the  designation  for  the  subunit 
previously  identified  as  S4  in  humans,  p56  in  bovine  cells,  and  Mts2,  Yhs4p,  or  Yta5  in 
yeast;  and  Rpt6p  is  the  designation  for  the  subunit  previously  identified  as  S  8  in  humans, 
p45  in  bovine  cells,  and  Sugl,  Cim5,  or  Crl3  in  yeast.  Mutants  defective  in  each  of  the 
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six  Rpt  proteins  can  show  widely  variant  phenotypes,  which  helped  to  contribute  to  the 
diverse  nomenclature  in  the  literature  even  within  the  same  yeast  species.  In  the  "unified 
nomenclature,"  the  other  twelve  subunits  of  the  RP  were  designated  Rpn  subunits,  for 
regulatory  particle  non-ATPase  proteins  (55).  The  two  largest  Rpn  subunits,  Rpnlp  and 
Rpn2p,  are  part  of  the  Base  domain,  along  with  the  six  Rpt  subunits  (72).  The  RpnlO 
subunit,  which  was  first  identified  as  a  ubiquitin-binding  subunit  (44, 48),  connects  the 
eight-subunit  Base  with  the  Lid  domain  composed  of  the  other  twelve  Rpn  subunits  (72). 
This  was  shown  when  a  modified  proteasome  containing  only  the  20S  core  and  the  Base 
domain  was  purified  (72)  from  a  yeast  RpnlO  mutant.  This  proteasome  could  both  unfold 
and  degrade  proteins;  however,  ubiquitinated  proteins  were  not  degraded  (72).  The  two 
non-ATPases  of  the  Base,  Rpnl  and  Rpn2,  are  predicted  to  form  a  hydrophobic  surface 
which  binds  partially  unfolded  substrates  in  a  non-specific  manner  (269).  It  has  been 
proposed  that  the  subunits  in  the  Base  sub-domain  of  the  RP  of  the  26S  proteasome  may 
be  able  to  discriminate  between  folded  and  unfolded  states  of  the  protein.  In  addition, 
depending  on  interacting  factors  {e.g.,  the  20S  proteasome,  other  subunits  in  the  RP),  the 
Base  can  target  the  substrate  either  for  refolding  or  for  unfolding  and  degradation  (24, 
72). 

AAA-type  ATPases 

The  AAA  (ATPases  associated  with  a  variety  of  cellular  activities)-type  ATPases 
are  a  subgroup  of  the  Walker-type  NTPase  family  (reviewed  in  37,  170,  186).  Walker- 
type  NTPases  all  have  a  well-conserved  region  which  contains  both  the  Walker  A 
(GPPGXGKT)  and  Walker  B  (DEID  box)  motifs  (271).  The  Walker  A  motif  binds  the  p 
and  y  phosphates  of  nucleotide  triphosphates  (208),  and  the  Walker  B  motif  is  involved  in 


14 

Mg^'^-coordination  and  ATP  hydrolysis  (27).  Members  of  the  AAA  family  have  an 
additional  highly  conserved  second  region  of  homology  (SRH)  found  C-terminal  to  the 
Walker  A  and  B  motifs  (240).  The  SRH  region  has  been  proposed  to  have  a  catalytic  role 
in  ATP  hydrolysis  based  on  studies  of  FtsH  proteins  with  point  mutations  in  the  SRH. 
These  SRH  mutants  were  able  to  bind  but  not  hydrolyze  ATP  (111).  The  AAA  proteins 
are  all  Mg^^-dependent  ATPases  which  are  inhibited  by  n-ethylmaleimide  (NEM),  a 
sulfhydryl  alkylating  agent  which  is  presumed  to  bind  to  a  well-conserved  cysteine 
residue  in  the  C-terminal  region  of  the  SRH  domain  (294).  All  members  of  this  family 
have  very  well  conserved  C-terminal  domains  but  very  little  if  any  N-terminal  identity. 
These  AAA  perform  a  wide  variety  of  roles  in  the  cell,  ranging  from  organelle  biogenesis 
to  proteolysis.  A  number  of  these  proteins  are  involved  in  protein  folding  and  unfolding 
as  well  as  in  the  disassembly  of  protein  complexes.  The  AAA  proteins  can  be  divided 
into  those  with  one  AAA  module  (Type  I),  and  those  with  two  (Type  II).  The  Rpt 
members  of  the  RP,  ClpX,  ClpQ  (HslU),  and  FtsH  (HflB)  proteins  are  all  Type  I  AAA- 
ATPases,  while  ClpA,  ClpB,  ClpC  (219),  Hspl04,  NSF  (241),  Cdc48  (57),  and  Lon  are 
Type  II  ATPases.  Notably,  for  several  of  the  Type  II  AAA  proteins,  one  or  both  of  the 
AAA  domains  are  necessary  for  oligomerization  of  the  ring  complex,  and  ATP  binding 
(but  not  hydrolysis)  is  often  necessary  for  complex  formation  (89,  225,  297). 

AAA-ATPases  in  the  RP  of  the  26S  proteasome.  As  mentioned  above,  the  six 
ATPase  members  (Rptlp  to  Rpt6p)  of  the  19S  cap  of  the  26S  proteasome  are  all  Type  I 
AAA  proteins.  Interestingly,  the  AAA  subunits  of  the  RP  are  much  more  similar  between 
species  than  any  of  the  Rpn  (non-ATPase)  subunits  of  the  RP.  In  addition,  the  Rpt 
subunits  have  higher  identity  to  homologs  from  other  species  than  to  paralogs  within  the 
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same  organism,  indicating  that  these  subunits  most  likely  serve  non-redundant  specific 
functions  within  the  cell.  For  example,  Rpt2p/S4  in  human  and  yeast  is  74%  identical, 
while  Rpt2p  and  RptSp  within  S.  cerevisiae  are  only  49%  identical  at  the  amino  acid 
level.  Likewise,  the  M.  jannaschii  Rpt2p  homolog  (PAN)  is  49%  identical  to  human 
Rpt2p  at  the  amino  acid  level,  while  the  identity  of  Rpt2p  and  RptSp  paralogs  within  M. 
jannaschii  is  only  38%.  In  yeast,  substitutions  of  the  conserved  ATP-binding  residue 
lysine  in  the  Walker  A  motif  were  lethal  in  four  out  of  the  six  proteasomal  ATPase  (Rpt) 
mutants  (74),  which  is  a  further  indication  that  these  six  proteins  serve  non-redundant 
functions  in  the  cell  (203).  Based  on  the  low  sequence  identity  of  the  N-terminal  regions 
of  the  AAA  subunits  of  the  RP,  it  has  been  suggested  that  this  region  is  responsible  for 
substrate  selection  and/or  binding  to  specific  Rpn  subunits  in  the  Lid  sub-domain  of  the 
RP. 

Other  AAA-ATPases.  One  well-characterized  AAA  protein  involved  in 
eukaryotic  organelle  biogenesis  is  Cdc48  (57),  which  mediates  ATP-dependent  fusion  of 
ER  membranes.  This  protein  forms  homohexamers  in  the  cytosol  and  was  first  identified 
by  complementation  of  mutants  which  arrested  with  an  undivided  nucleus  during  mitosis 
(57).  One  other  homolog  of  Cdc48  is  p97A^CP  (valosin-containing  protein),  which  also 
forms  a  homohexameric  ring  and  is  involved  in  membrane  fusions  in  the  Golgi  apparatus 
(189).  Other  well-characterized  Type  II  AAA  proteins  in  eukaryotes  include  NSF  (NEM- 
sensitive  fusion  protein)  and  its  yeast  homolog  Secl8p,  which  are  proteins  involved  in 
mediating  vesicle  fusion  events  and  transport  between  the  ER  and  the  Golgi  apparatus  in 
the  secretory  pathway.  The  NSF  is  required  for  protein  transport  from  the  ER  to  the  Golgi 
complex,  for  fusion  between  endosomal  vesicles,  and  for  transport  of  newly  synthesized 
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proteins  to  the  plasma  membrane  (165).  The  NSF  forms  cytosolic  homohexamers  as  well 
as  larger  membrane-associated  complexes  with  the  aid  of  soluble  NSF  attachment 
proteins  (SNAPs)  and  SNAP  receptors,  or  SNAREs  (165).  The  NSF  dissasembles  the 
SNAP/SNARE  complexes  in  an  ATP-dependent  reaction,  which  allows  another  cycle  of 
membrane  fusion  to  occur  (89).  The  N-terminal  domain  of  NSF  has  been  shown  to  be  the 
substrate-binding  domain  and  is  not  required  for  hexamerization  of  the  complex  (241, 
281).  The  first  AAA  module  of  NSF,  Dl,  has  high  ATPase  activity  which  is  required  for 
a  conformational  change  to  occur  which  mediates  disassembly  of  SNARE  complexes 
(89).  The  second  module  of  NSF,  D2,  has  very  high  affinity  for  ATP  but  very  low 
ATPase  activity  (89).  The  binding  of  ATP  or  a  non-hydrolyzable  analog  to  the  D2 
module  is  required  for  proper  oligomerization  of  the  hexameric  complex  in  the  cytosol, 
and  when  D2  is  expressed  alone,  a  hexameric  complex  still  forms  (241,  281).  The  crystal 
structure  of  the  D2  hexamerization  domain  has  been  solved  to  1 .75  A  resolution  in 
complex  with  both  ATP  and  the  nucleotide  analog  AMP-PNP  (143,  297).  The  NSF-D2 
was  found  to  form  a  regular  hexamer  of  97A  in  diameter  with  the  nucleotide  (ATP  or 
AMP-PNP)  bound  between  neighboring  subunits  of  the  ring  (143,  297). 

A  member  of  the  AAA  family  has  also  been  isolated  from  the  gram-positive 
actinomycete  Rhodococcus  erythropolis  (289).  When  this  protein  was  synthesized  in  E. 
coli,  the  subimits  formed  a  six-membered  ring-shaped  complex  with  ATPase  activity, 
which  the  authors  termed  ARC  (AAA-ATPase  Ring-shaped  Complex).  The  arc  gene  was 
found  1500  bp  upstream  of  an  operon  containing  the  20S  proteasome  genes.  However, 
interaction  of  ARC  with  the  20S  proteasome  was  not  reported. 
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Chaperone-like  activity  of  AAA  proteins.  Ironically,  a  study  of  the  mechanisms  of 
proteolysis  in  the  cell  must  also  include  a  study  of  the  mechanisms  of  protein  folding.  On 
the  surface,  protein  folding  and  assembly  would  appear  to  be  completely  different 
processes  from  protein  degradation.  However,  it  has  become  evident  that  some  of  the 
same  proteins  involved  in  degradation  of  protein  substrates  can  also  act  as  chaperones  to 
aid  in  the  folding  of  proteins,  at  least  in  vitro  (reviewed  in  239).  In  addition,  studying  the 
chaperone  and  unfoldase  activities  of  ATPase  components  of  ATP-dependent  proteases 
independent  of  the  proteolytic  components  can  aid  in  understanding  the  mechanism  by 
which  protein  substrates  are  unfolded  and  translocated  into  the  proteasome  and  other 
proteases.  The  AAA  proteins  which  are  components  of  ATP-dependent  proteases  have 
been  demonstrated  to  have  the  ability  to  serve  as  chaperones  independent  of  the 
proteolytic  component  (reviewed  in  80,  239,  300).  Both  ClpA  and  ClpX  have  been 
observed  to  prevent  aggregation  of  substrate  proteins  in  vitro  (145, 178,  278,  284).  The 
ClpB  protein  (290)  as  well  as  its  yeast  homologue  Hspl04  (184)  can  also  serve  as 
molecular  chaperones  (34,  185),  but  neither  one  of  these  AAA  proteins  has  been  reported 
to  associate  with  a  proteolytic  component  (e.g.,  ClpP,  HslV).  There  is  some  evidence  that 
FtsH  may  also  function  as  a  chaperone  (210).  The  N-terminal  AAA  domain  of  the 
membrane-bound  Ymel  protease  (FtsH  homolog)  from  yeast  can  operate  as  a  chaperone 
to  bind  unfolded  protein  substrates  and  prevent  their  aggregation  even  when  the  C- 
terminal  protease  domain  is  either  inactivated  or  removed  (144).  The  Base  component  of 
the  26S  proteasome  which  contains  the  six  Rpt  subunits  also  has  ATP-dependent 
chaperone  activity  independent  of  both  ubiquitin  and  the  20S  proteasome  (24).  In  one 
study,  non-ubiquitinated  citrate  synthase  could  be  efficiently  refolded  by  the  Base 


component,  and  when  the  20S  proteasome  was  included  in  the  assay,  citrate  synthase  was 
still  refolded  but  not  very  efficiently  degraded  (24).  Taken  together,  these  results  lead  to 
the  proposal  that  AAA-ATPases  can  behave  as  relatively  non-specific  "unfoldases"  to 
remodel,  disassemble,  and  unfold  proteins;  however,  the  ultimate  fate  of  a  substrate 
protein  rests  with  the  protein  {e.g.,  the  20S  proteasome)  which  interacts  with  the  AAA 
protein.  In  addition,  the  final  destiny  of  a  substrate  protein  for  either  degradation  or 
refolding  can  depend  on  the  balance  between  affinity  of  the  substrate  for  the  AAA- 
ATPase  as  well  as  the  stability  of  the  interaction  between  the  AAA  component  and  the 
proteolytic  component  (reviewed  in  80,  239,  300). 

The  RP  of  the  proteasome  has  also  been  shown  to  have  a  chaperone-like  role  in 
the  yeast  nucleotide  excision  repair  (NER)  pathway  for  UV -damaged  DNA  (205).  The 
26S  complex  binds  to  Rad23,  a  protein  which  is  required  for  NER  and  which  also 
contains  a  ubiquitin-like  domain.  Removal  of  the  ubiquitin-like  domain  from  Rad23  or 
inhibition  of  RP  complex  AAA-ATPases  resulted  in  UV-sensitivity.  However,  inhibition 
of  the  proteolytic  activity  of  the  20S  proteasome  did  not  result  in  sensitivity,  indicating 
that  the  RP  complex  has  a  chaperone-like  role  independent  of  the  proteolytic  activity  of 
the  20S  proteasome  (205).  It  has  been  proposed  that  the  19S  complex  may  bind  to  the 
Rad23/excised  DNA  complex  and  disassemble  the  complex  to  allow  for  recycling  of 
Rad23  (and  other  proteins  involved  in  NER).  This  pathway  bears  some  similarly  to  the 
NSF  protein  disassembly  pathway,  which  disassembles  SNARE/SNAP  complexes  to 
enable  fiirther  rounds  of  membrane  fusion  and  protein  trafficking  in  the  ER  to  take  place. 
Notably,  this  study  provided  an  in  vivo  example  of  the  Base  domain  of  the  RP  behaving 
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as  a  chaperone  and  provided  further  evidence  that  the  chaperone  activity  which  had  been 
previously  observed  in  vitro  may  also  have  physiological  relevance. 

ATP-dependent  Proteases  in  E.  coli 

Energy-dependent  proteolysis  is  also  important  in  the  maintenance  of  the 
eubacterial  cell.  Several  two-component  ATP-dependent  proteolytic  systems  have  been 
identified  in  E.  coli,  including  ClpAP,  ClpXP,  and  ClpYQ/HslUV  (reviewed  in  78, 193). 
In  analogy  to  the  eukaryotic  26S  proteasome,  each  of  these  proteases  contains  a 
proteolytic  component  (ClpP  or  ClpQ/HslV)  and  an  ATPase  regulatory  component 
(ClpA,  ClpX,  or  ClpY)  (209,  277). 

The  ClpAP  and  ClpXP  proteases.  The  ClpP  protein  is  a  serine  protease  which 
forms  a  double-ringed  complex  with  seven-fold  symmetry  (224).  The  ClpP  protease  can 
associate  with  either  of  the  hexameric  ATPase  components  ClpA  or  ClpX  (82).  While  the 
ClpAP  and  ClpXP  proteases  appear  to  be  quite  similar  to  the  proteasome  in  overall 
structure  based  on  electron  microscopy  (117),  the  crystal  structure  of  ClpP  revealed  a 
unique  "hatchet-like"  fold  of  its  subunits  (273,  274),  unlike  the  Ntn-hydrolase  p- 
sandwich  fold  of  the  20S  proteasome  subunits.  The  AAA  components  ClpA  or  ClpX 
provide  substrate  specificity  to  the  ClpP  proteolytic  complex.  The  ClpA  ATPase  can 
either  target  the  substrate  RepA  for  degradation  by  ClpP  or  act  alone  as  a  chaperone  to 
disassemble  RepA  dimers  into  monomers  (284).  The  ClpX  ATPase  acts  to  disassemble 
and/or  degrade  bacteriophage  initiation  protein  ID  aggregates  (278,  288)  and  MuA-DNA 
tetramers  (138).  Assembly  of  both  ClpA  and  ClpX  with  ClpP  requires  ATP  binding  but 
not  hydrolysis  (100,  230).  When  wild-type  ClpA  is  purified  from  E.  coli,  it  forms  a 
homo-hexameric  complex  composed  of  84-kDa  subunits  (104).  However,  when  ClpA 
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was  overproduced  using  a  plasmid  in  E.  coli,  both  full-length  (ClpA84)  and  an  N- 
terminally  truncated  form  (ClpA65)  were  produced  due  to  an  internal  translation 
initiation  site,  and  a  hexameric  complex  containing  both  proteins  was  purified  (217). 
When  a  mutant  strain  was  constructed  to  produce  the  truncated  ClpA65  protein  alone,  a 
hexameric  complex  still  formed.  However,  this  complex  had  only  5%  of  the  ATPase 
activity  of  a  ClpA84  complex  and  in  addition  could  not  support  ATP-dependent 
degradation  of  casein.  A  complex  formed  of  the  full-length  ClpA84  alone  could  support 
2.5-fold  higher  ATP-dependent  casein  degradation  than  the  ClpA84/65  mixed  complex 
(217). 

The  HslUV  (ClpYQ)  protease.  The  proteolytic  component  in  E.  coli  with  the  most 
similarity  to  the  proteasome  is  the  HslUV  (ClpYQ)  complex  (201),  which  has  been  called 
both  the  "prokaryotic  proteasome"  and  a  "hybrid"  of  Clp  and  the  proteasome  (19).  The 
subunits  of  the  proteolytically  active  HslV  (ClpY)  component  have  some  sequence 
identity  (-20%)  to  the  P  subunits  of  archaeal  and  eukaryal  proteasomes  (201,  202,  295), 
with  the  same  p-sandwich  crystallographic  fold  (19)  and  N-terminal  threonine  active  site 
(225).  Interestingly,  there  has  been  one  HslV  homolog  (with  52%  identity  to  the  E.  coli 
HslV)  identified  in  Bacillus  subtilus  which  uses  an  N-terminal  serine  residue  as  its 
catalytic  nucleophile  (110).  In  contrast  to  the  proteasome,  the  HslV  protease  forms  a 
double  ring  with  six-fold  rather  than  seven- fold  symmetry,  while  the  AAA  component 
HslU  (ClpQ)  also  forms  a  six-membered  ring  (118).  There  has  been  no  protein  identified 
which  serves  the  function  of  the  a  subunits  in  the  20S  proteasome.  However,  the  HslV 
complex  is  not  proteolytically  active  unless  it  has  formed  a  complex  with  the  ATPase 
component  HslU  (295),  and  the  opening  into  the  catalytic  chamber  formed  by  the  ring  of 
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HsIV  subunits  is  quite  narrow  (19A).  The  crystal  structure  of  HslV  has  been  solved  both 
alone  (19)  and  in  complex  with  the  AAA  regulatory  component  HslU  (233).  This  was  the 
first  solution  of  a  crystal  structure  of  both  components  of  a  two  component  ATP- 
dependent  protease  while  in  complex  together.  The  authors  confirmed  that  the  complex 
was  enzymatically  active  by  assaying  the  crystals  for  peptidase  activity  and  by  using 
small  angle  X-ray  scattering  to  solve  the  quaternary  structure  of  HslUV  in  solution  under 
conditions  where  the  complex  was  active  (233).  The  HslU  component  contains  two  AAA 
domains,  one  in  the  N-terminal  region  and  one  in  the  C-terminal  region.  Both  were  quite 
similar  to  the  D2  ATPase  domain  of  the  NSF  protein  (see  above)  (233).  The  HslU 
complex  also  contains  a  novel  feature  between  the  N-  and  C-terminal  AAA  modules, 
which  has  been  termed  the  "intermediate  domain."  This  domain  protrudes  outward  from 
the  HslU  hexamer  away  from  the  HslUV  complex  (233)  and  most  likely  has  a  role  in 
substrate  selection  and  binding.  When  the  HslU  and  HslV  complexes  bind  to  one  another 
in  the  presence  of  ATP,  several  conformational  changes  occur.  The  N-  and  C-terminal 
AAA  modules  of  the  HslU  complex  form  a  cleft  which  is  filled  by  the  N-terminal  apical 
helical  region  of  the  HslV  subunits.  In  addition,  the  C-terminal  14  residues  of  each  HslU 
subunit  form  a  well-defined  a  helix  which  extends  to  bind  between  subunits  of  the  HslV 
protease  to  form  the  active  HslUV  complex  (233). 

The  Lon  protease.  Unlike  the  26S  proteasome  and  the  Clp  family  of  proteins,  the 
Lon  and  FtsH/HflB  proteases  contain  the  ATPase  and  proteolytic  components  on  a  single 
subunit  (35).  Lon  is  a  serine  protease  which  was  first  purified  from  E.  coli  as  a  complex 
of  four  identical  subunits(35)  but  has  recently  been  purified  from  yeast  mitochondria  as  a 
seven-membered  ring  complex  (237).  Each  subunit  contains  an  ATPase  domain  near  the 
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middle  of  the  sequence  and  an  active-site  serine  near  the  C-terminus  of  the  polypeptide 
(reviewed  in  239).  Interestingly,  the  ATPase  and  protease  domains  of  Lon  were 
synthesized  separately  in  the  yeast  mitochondria  and  were  found  to  still  interact  with  one 
another  to  perform  the  same  functions  as  the  native  protein  (263).  Substrates  of  Lon 
include  transcriptional  activators,  cell  division  proteins,  and  mediators  of  the  stress 
response.  The  lon  mutants  are  deficient  in  septum  formation  and  FtsZ  ring  formation 
(reviewed  in  239).  The  Lon  protease  is  highly  conserved  in  all  three  domains  of  life,  and 
yeast  lon  mutants  can  be  complemented  by  overproduction  of  the  E.  coli  Lon  protein 
(248). 

FtsH  (HflB)  proteases.  The  FtsH  (HflB)  protein,  a  zinc  metalloprotease,  contains 
two  membrane-spanning  domains  near  the  N-terminus,  followed  by  an  AAA  domain  and 
a  Zn  -dependent  proteolytic-active  site  (HEXXH,  where  X  is  any  uncharged  amino  acid) 
in  the  cytoplasm  (reviewed  in  210).  Recombinant  FtsH  assembles  into  homo-multimeric 
rings  based  on  electron  microscopy  (227),  and  this  oligomerization  is  dependent  on  the 
N-terminal  membrane-spanning  region  (5).  There  is  also  evidence  that  the  cytoplasmic 
ATPase  domains  self-interact  and  that  this  association  is  necessary  for  degradation  of 
soluble  proteins  (2).  Furthermore,  FtsH  can  form  a  large  multi-protein  complex  in  the 
membrane  with  several  other  proteins,  including  HflKC  and  YccA  (119,  120).  The  FtsH 
protease  has  been  shown  to  recognize  substrates  based  on  their  C-terminal  sequences 
(93).  These  substrates  include  both  the  membrane-bound  SecY  translocase  (4)  and 
subunit  a  of  the  Fq- ATPase  (3)  as  well  as  the  soluble  heat-shock  factor  ct^'^  (109,  254). 
Overproduction  of  FtsH  can  complement  clpP  mutants  in  E.  coli  (93),  and  while  ftsH 
knockout  mutants  are  not  able  to  grow,y?5// temperature-sensitive  mutants  can  be 
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partially  suppressed  by  overproduction  of  the  chaperones  GroEL/GroES  (Hsp60/Hspl0) 
and  Hsp90  (226).  These  results  indicate  possible  overlapping  functions  between  proteases 
(ClpP  and  FtsH)  and  molecular  chaperones. 

Substrate  Targeting  and  Degradation 

One  common  theme  throughout  all  three  domains  of  life  is  that  the  primary 
sequence  of  a  protein,  especially  sequences  near  the  N-  and/or  C-termini,  ultimately 
determine  where  a  protein  will  be  localized  in  the  cell,  how  it  will  be  processed  to 
become  fully  active,  and  when  (and  how)  it  will  ultimately  be  degraded.  In  addition,  if  a 
protein  is  unstable  or  partially  unfolded,  sequences  on  the  protein  which  would  normally 
be  buried  {e.g.,  hydrophobic  residues)  become  exposed  to  the  environment.  These  can 
also  serve  as  signals  for  either  chaperones  or  proteases.  In  addition,  the  environment  of 
the  cell  {e.g.,  heat-shock  conditions,  oxidative  stress)  will  affect  the  thermodynamic 
stability  of  cytosolic  proteins  (192). 

Ubiquitination.  In  eukaryotic  cells,  substrate  recognition  in  the  cytosol  is  usually 
at  the  level  of  ubiquitin  conjugation,  where  numerous  proteins  are  involved  in  targeting 
substrates  to  the  26S  proteasome  (reviewed  in  36,  94,  95).  Ubiquitin  is  a  76  amino-acid 
protein  which,  when  conjugated  to  a  substrate  protein,  targets  it  for  degradation  by  the 
26S  proteasome  complex.  First,  ubiquitin  is  activated  and  linked  at  its  C-terminus  to  a 
ubiquitin-activating  enzyme  (El)  in  an  energy-dependent  reaction  via  a  ubiquitin- AMP  ' 
intermediate.  The  ubiquitin  is  then  transferred  to  a  thiol  group  on  a  ubiquitin-conjugating 
enzyme  (E2).  The  E2  enzyme  will  then  bind  to  a  ubiquitin-protein  ligase  (E3),  which  is 
directly  bound  to  a  specific  substrate  protein.  The  carboxy-terminus  of  ubiquitin  will  then 
be  directly  linked  to  the  s-amino  group  of  a  lysine  residue  within  the  target.  After  the  first 
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ubiquitin  is  attached,  further  rounds  of  ubiquitination  can  then  occur  by  the  same 
mechanism,  with  the  incoming  ubiquitin  being  attached  via  its  carboxyl  group  to 
(primarily)  lysine  48  of  the  previously-attached  ubiquitin  molecule.  A  fourth 
ubiquitination  conjugation  factor,  E4,  has  been  shown  to  be  essential  in  recognition  of 
ubiquitinated  proteins  and  in  mediating  further  rounds  of  ubiquitination  (133).  After  a 
substrate  is  ubiquitinated  with  a  minimum  of  four  ubiquitin  molecules  (252),  it  is  then 
recognized  by  subunits  in  the  Lid  domain  of  the  19S  cap  of  the  26S  proteasome,  such  as 
RpnlO  (44,  264).  The  ubiquitin  chains  are  then  removed  by  deubiquitinating  (DUB) 
enzymes  (such  as  Doa4/Ubp4)  which  are  also  in  the  Lid  of  the  RP  (182),  and  isopeptidase 
T/Ubpl4  (285)  before  the  protein  substrate  is  fed  into  the  proteolytic  core  of  the  complex. 
Ubiquitin  is  a  very  stable  protein  that  is  removed  intact  from  the  substrate  protein  and  can 
then  re-enter  the  ubiquitination  cycle.  While  most  organisms  which  have  been  analyzed 
contain  only  one  or  two  El  ubiquitin-activating  enzymes,  they  contain  numerous  E2,  E3, 
and  DUB  enzymes  (reviewed  in  36,  94,  95).  Unlike  E2  enzymes,  E3  proteins  which  have 
been  studied  have  no  sequence  similarity  to  one  another,  even  within  the  same  species, 
and  most  of  them  have  been  identified  by  binding  assays  of  specific  targets  in  the 
proteolytic  system.  Thus,  there  may  be  a  number  of  E3  enzymes  which  are  present  in  a 
given  organism  but  which  have  not  yet  been  identified.  It  is  likely  that  these  multiple  E2 
and  E3  enzymes  acting  in  cooperation  with  one  another  and  with  other  proteins  such  as 
molecular  chaperones  provide  both  specificity  as  well  as  the  ability  to  target  a  wide 
variety  of  substrate  proteins. 

Sequences  within  the  substrate  protein  which  will  render  it  more  susceptible  to 
recognition  by  ubiquitination  enzymes  have  also  been  identified.  The  "N-end  rule" 
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(reviewed  in  265,  266)  states  that  certain  residues  of  a  protein  at  the  N-terminus  will 
make  it  more  susceptible  to  degradation.  Replacement  of  those  residues  with  other  amino 
acids  will  protect  the  protein  from  degradation.  In  addition,  the  placement  of  lysine 
residues  with  regards  to  the  N-terminal  residue  can  also  play  a  role  in  whether  or  not  the 
protein  will  be  ubiquitinated.  Certain  residues  on  the  C  terminus  have  also  been  shown  to 
render  a  protein  more  susceptible  to  degradation  (115, 147).  Phosphorylation  is  an 
additional  mechanism  which  can  render  a  substrate  either  more  or  less  accessible  to 
ubiquitination  proteins  (32,  33). 

Other  eukaryal  degradation  signals.  While  ubiquitination  is  understood  to  be  the 
primary  signal  for  substrate  recognition  in  eukaryotes,  ubiquitination  is  not  necessary  for 
all  proteins  to  be  targeted  to  the  proteasome,  including  ornithine  decarboxylase  (148)  and 
cyclin-dependent  kinase  inhibitor  p21  (p2l'"'''')  (221).  The  exact  nature  of  the  signaling 
involved  in  these  and  other  substrates  is  becoming  more  clear,  as  there  is  evidence  that 
PEST  sequences  on  the  substrates  themselves  may  be  involved  (reviewed  in  195,  268). 
These  PEST  sequences  are  regions  enriched  in  proline,  glutamate,  serine,  and  threonine 
residues,  and  uninterrupted  by  positively  charged  residues.  They  are  frequently  found  in 
rapidly  degraded  and  highly  regulated  proteins  in  the  eukaryotic  cell  and  have  been 
recognized  as  degradation  signals  (200).  However,  the  mechanism  for  recognition  of 
PEST  sequences  by  the  proteolytic  machinery  is  not  very  well  understood.  For  ornithine 
decarboxylase  (ODC),  the  PEST  sequence  is  found  near  the  C-terminus  and,  in 
combination  with  a  second  protein  called  antizyme,  helps  to  target  ODC  to  the  26S 
proteasome  (69,  70,  148).  Antizyme  can  directly  target  ODC  to  the  26S  proteasome, 
bypassing  the  ubiquitin  system.  Similar  to  ubiquitin,  antizyme  is  released  intact  from  the 


26S  complex,  allowing  one  antizyme  molecule  to  participate  in  multiple  rovmds  of  ODC 
degradation  (149).  Cyclin-dependent  kinase,  on  the  other  hand,  is  targeted  to  the 
proteasome  only  when  the  tetrameric  complex,  composed  of  two  regulatory  (R)  and  two 
catalytic  (C)  subunits,  has  dissociated  into  monomers.  Both  R  and  C  subunits  contain 
PEST  sequences,  but  these  sequences  appear  to  be  buried  in  the  tetramer  and  are  only 
exposed  after  cAMP-dependent  dissociation  of  the  complex  (21).  In  some  cases,  PEST 
sequences  can  also  act  in  cooperation  with  the  ubiquitin  system.  The  transcriptional 
inhibitor  iKBa,  which  contains  a  C-terminal  PEST  sequence,  is  rapidly  degraded  when 
dissociated  fi'om  the  transcriptional  activator  NF-kB,  and  deletion  of  the  PEST- 
containing  C-terminal  residues  stabilizes  the  iKBa  protein  (32).  In  addition,  there  are 
certain  cytokines  which  can  target  iKBa  for  degradation  while  still  in  association  with 
NF-kB  (33).  In  this  case,  PEST  sequences  act  prior  to  ubiquitination  by  serving  as  a 
signal  in  the  phoshorylation  of  two  specific  serine  residues  in  the  iKBa  protein  (33).  The 
phosphorylated  iKBa  protein  is  then  recognized  by  ubiquitination  enzymes  or  other 
members  of  the  proteolytic  pathway  (195).  Interestingly,  while  some  proteins  do  not 
appear  to  require  ubiquitination  for  degradation,  some  proteins  can  be  labeled  with 
ubiquitin  and  still  not  be  degraded  by  the  26S  proteasome  (105, 129).  So,  while  much 
progress  has  been  made  in  understanding  the  mechanisms  of  substrate  labeling  and 
degradation  in  eukaryotes,  there  are  still  many  questions  to  be  answered  and  much  to  be 
learned  about  this  complex  process. 

Eubacterial  degradation  signals.  While  the  ubiquitin  system  is  conserved  in  all 
eukaryotes  examined,  neither  ubiquitin  nor  the  enzymes  of  the  ubiquitination  pathway 
have  been  found  in  any  archaeal  or  eubacterial  cells.  No  gene  with  significant  identity  to 
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ubiquitin  has  been  discovered  in  any  of  the  prokaryotic  or  archaeal  genomes  which  have 
been  completely  sequenced.  In  prokaryotic  organisms,  the  model  is  that  proteins  are 
generally  directly  targeted  to  the  ATPase  subunits  or  domains  of  the  protease  by 
sequences  inherent  in  the  proteins  themselves. 

Proteins  which  recognize  and  degrade  N-end  rule  substrates  have  also  been 
identified  in  eubacteria  (253).  Based  on  genetic  screening  of  E.  coli  mutants  in  which  N- 
end  rule  substrates  were  stabilized,  ClpAP  (but  not  ClpXP)  has  been  implicated  in  this 
pathway  (228).  In  contrast  to  eukaryotes,  an  intermediary  protein  that  directly  recognizes 
the  N-terminal  residue  of  N-end  rule  substrates  and  targets  the  substrate  to  ClpA  has  not 
been  identified  in  any  prokaryotes.  Instead,  it  has  been  proposed  that  substrate 
recognition  may  be  at  the  level  of  the  ATPase  component  of  the  protease  itself  (reviewed 
in  94).  Therefore,  these  ATPases  are  likely  to  not  only  unfold  and  feed  substrates  into  the 
proteolytic  core  (similar  to  the  Rpt  subunits  of  the  RP),  but  they  may  also  directly 
recognize  substrate  proteins  (similar  to  the  Rpn  ubiquitin-recognition  subunits  of  the  RP). 

Another  mechanism  for  targeting  proteins  for  degradation  in  eubacteria  is  the 
SsrA-tagging  system.  The  lOSaRNA  transcript  of  the  ssrA  gene  can  add  a  "degradation 
sequence"  to  an  RNA  transcript  without  a  stop  codon  or  to  one  that  is  stalled  on  the 
ribosome.  The  ssrA  transcript  is  a  363nt  RNA  unit  which  has  been  termed  "tmRNA" 
because  it  serves  the  function  of  both  tRNA  and  mRNA  (1 16, 256).  The  tRNA  portion  of 
the  tmRNA  is  charged  with  alanine  and  can  bind  to  the  P  site  of  a  stalled  ribosome. 
Translation  then  switches  to  the  mRNA-like  portion  of  the  tmRNA  that  encodes  a  1 0- 
residue  peptide  tag.  Thus,  the  final  protein  has  the  sequence  AANDENYALAA  added 
co-translationally  at  the  C-terminus.  This  peptide  sequence  targets  the  protein  for 
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degradation  by  cellular  proteases,  such  as  ClpAP,  ClpXP,  or  FtsH  (79,  93).  Interestingly, 
ClpAP  and  ClpXP  have  been  observed  to  recognize  both  the  N-  and  C-termini  of 
substrates  (77, 138),  indicating  that  one  regulatory  ATPase  can  recognize  several 
different  types  of  recognition  signals.  In  addition,  other  proteins  have  been  shown  to  be 
involved  in  enhancing  the  specificity  of  the  ATPase  component  for  a  tagged  substrate, 
including  SspB.  The  SspB  protein  is  a  ribosome-associated  protein  which  binds 
specifically  to  SsrA-tagged  proteins,  targeting  them  specifically  to  the  ClpX  ATPase 
(146).  Even  though  both  ClpA  and  ClpX  can  recognize  SsrA-tagged  substrates  in  vitro, 
there  is  evidence  that  ClpX  recognizes  the  majority  of  these  proteins  in  vivo  (79).  The 
sspB  gene  is  part  of  an  operon  induced  by  starvation,  which  is  a  time  of  stress  when  more 
polypeptide  chains  are  stalled  on  the  ribosome.  The  upregulation  of  sspB  allows  for  more 
specific  control  of  which  proteins  are  targeted  for  proteolysis  than  there  would  be  if  only 
ClpX  itself  was  upregulated.  Specificity  factors,  such  as  SspB,  help  to  explain  how  a 
single  ATPase  can  recognize  a  diverse  number  of  substrates  and  also  how  different 
cellular  environmental  conditions  will  cause  specific  proteins  to  be  targeted  for  unfolding 
and  degradation. 

Substrate  unfolding  and  degradation.  For  many  of  the  energy-dependent  protease 
components,  including  the  26S  proteasome,  ClpAP,  and  ClpXP  there  is  an  asymmetrical 
mismatch  between  the  ATPase  component  (with  a  six-membered  ring)  and  the  protease 
component  (with  a  seven-membered  ring).  It  is  thought  that  ATP  hydrolysis  by  the 
regulatory  component  causes  rotation  of  the  ATPase,  allowing  the  substrate  to  be 
threaded  into  the  proteolytic  core  of  the  complex  (18,  109).  Degradation  of  protein 
substrates  by  either  ClpAP  or  ClpXP  is  a  highly  processive  process  (251);  that  is,  a 
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substrate  is  completely  degraded  and  released  before  another  substrate  molecule  is  taken 
up  by  the  protease.  Studies  indicate  that  protein  degradation  by  the  archaeal  20S 
proteasome  from  T.  acidophilum  is  also  very  processive  (6). 

While  degradation  by  the  eukaryotic  20S  proteasome  has  also  been  demonstrated 
to  be  very  processive  (126),  there  are  a  few  notable  exceptions.  For  example,  the 
transcriptional  activator  NF-kB  is  a  heterodimeric  protein  consisting  of  a  p65  and  a  p50 
subunit.  The  p50  subunit  is  synthesized  as  a  105-kDa  precursor  (pi 05),  and  the  26S 
proteasome  is  responsible  for  degradation  of  the  C-terminus  of  pi  05  to  yield  the  mature 
p50  protein  (180).  Interestingly,  the  C  termini  of  pi  05  and  of  iKBa  are  very  similar.  Both 
contain  PEST  sequences  which  lead  to  phosphorylation  of  the  protein  at  specific  serine 
residues  (180).  However,  the  mechanism  by  which  the  proteasome  endoproteolytically 
cleaves  pi 05  into  two  fragments  and  then  specifically  degrades  only  the  C-terminal 
region  is  unknown. 

Other  Proteases 

Other  proteases  have  been  isolated  which  can  partially  compensate  for  loss  of 
proteasome  function  in  mammalian  cells  including  the  subtilisin-like  serine  protease 
tripeptidyl  peptidase  II  (TPPII)  (64).  The  ability  to  compensate  for  proteasome  function 
was  first  observed  in  an  outgrowth  of  mouse  lymphoma  cells  which  had  adapted  to 
proteasome  vinyl  sulphone  inhibitors  (71).  The  TPPII  protein,  which  had  been  previously 
isolated  and  characterized  (12),  was  later  shown  to  be  responsible  for  this  adaptation  (64). 
The  TPPII  protein  forms  a  rod-shaped  structure  when  observed  using  electron 
microscopy  and  is  somewhat  larger  than  the  26S  proteasome  with  dimensions  of  ~50nm 
by  17  nm  (64).  It  is  an  exopeptidase  which  requires  the  N-terminus  of  the  protein 
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substrate  to  be  exposed,  and  it  releases  multiples  of  three  residues  after  cleavage  (12). 
While  there  is  little  sequence  or  structural  similarity,  it  has  been  hypothesized  that  this 
protease  may  be  the  functional  mammalian  homolog  of  the  archaeal  tricom  protease  (64); 
that  is,  it  may  further  degrade  oligopeptides  after  release  from  the  proteasome  (discussed 
below).  It  also  has  some  sequence  identity  to  the  archaeal  serine  protease  pyrolysin 
(discussed  below),  which  was  first  isolated  from  the  hyperthermophilic  archaeon 
Pyrococcus  furiosus  (270).  However,  pyrolysin  is  an  endoprotease  rather  than  an 
exoprotease. 

Archaeal  Proteases  and  Chaperones 

The  complete  genomic  sequences  of  nine  euryarchaea  have  been  published:  M 
jannaschi  (26),  a  hyperthermophilic  methanogen  with  an  optimal  growth  temperature  of 
82°  C,  originally  isolated  from  a  deep-sea  hydrothermal  vent  (108);  Archaeoglobus 
fulgidus  (128);  Methanobacterium  thermoautotrophicum  AH  (231);  Pyrococcus  abyssi 
(92);  Pyrococcus  horikoshii  0T3  (113);  Thermoplasma  acidophilum  (113,  204); 
Halobacterium  sp.  NRC-1  (171);  and  most  recently  Thermoplasma  volcanium  GSSl 
(1 14).  The  genomes  of  crcnaichaea  Aeropyrum  pernix  Kl  (1 12)  and  Sulfolobus 
solfataricus  (220)  have  also  been  sequenced.  In  addition,  the  sequence  of  M  maripaludis 
strain  LL  has  recently  been  completed  (J.  Leigh,  personal  communication). 

Archaeal  proteasomes.  Each  archaeal  genome  has  ORFs  encoding  a  and  P 
subunits  of  the  20S  proteasome,  which  are  unlinked  on  the  chromosome  (See  Table  1). 
Based  on  genome  sequencing,  both  Pyrococcus  species  as  well  as  Aeropyrum  pernix 
have  two  P  subunit  paralogs.  Haloferax  volcanii  is  the  only  organism  at  this  time  which 
has  been  shown  to  have  two  a  subunit  paralogs  (286);  even  the  completely  sequenced 
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Halobacterium  sp.  NRC-1  appears  to  have  only  one  a  subunit  (171).  All  Archaea 
examined,  with  the  exception  of  the  euryarchaea  T.  acidophilum  (204)  and  T.  volcanium 
(114),  also  contain  at  least  one  AAA-ATPase  with  high  identity  to  regulatory  subunit 
Rpt2p  (human  S4)  of  the  eukaryotic  26S  proteasome.  The  Rpt2p  homolog  of  M 
jannaschii  has  been  purified  and  characterized  (303).  The  authors  called  the  protein  PAN 
for  Proteasome  Activating  Nucleotidase.  This  protein  could  activate  the  ATP-dependent 
protein  hydrolyzing  activity  of  both  T.  acidophilum  and  M.  thermophila  20S 
proteasomes.  It  is  notable  that  H.  volcanii,  the  only  organism  that  has  been  shown  to 
contain  two  a  subunits  of  the  20S  proteasome,  also  has  two  PAN  paralogs  (J.  A.  Maupin- 
Furlow,  unpublished  results). 

One  especially  intriguing  difference  between  eukaryotes  and  the  archaea  is  that, 
even  though  archaea  contain  the  core  components  of  the  26S  system  (the  20S  proteasome 
and  at  least  one  homolog  of  the  ATPase  subunits  of  the  1 9S  cap),  no  homolog  of  the 
subunits  involved  in  recognition  and  removal  of  ubiquitin  has  been  identified.  In  addition, 
none  of  the  ubiquitin  activation,  conjugation,  or  ligation  (El,  E2,  and  E3)  enzymes  have 
been  identified.  Furthermore,  a  gene  encoding  ubiquitin  itself  is  not  predicted  to  be 
present  in  any  of  the  archaeal  genomes  that  have  been  sequenced.  However,  it  is  likely 
that  there  is  an  as  yet  unidentified  system  that  specifically  targets  substrates  to  ATP- 
dependent  proteases  in  Archaea.  For  example,  a  small  protein  (similar  to  ubiquitin)  or 
peptide  (similar  to  SsrA)  tags  larger  substrates  and  targets  them  for  degradation  by  the 
ATP-dependent  proteasome. 

Lon  and  FtsH  protease  homologs.  The  Lon  protease  is  highly  conserved 
throughout  all  three  domains  of  life  and  is  predicted  to  be  present  in  all  archaeal  genomes 
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which  have  been  sequenced.  The  proteolytic  serine  residue  active  site  as  well  as  residues 
involved  in  ATPase  activity  are  well-conserved  based  on  multiple-sequence  alignments. 
Interestingly,  the  archaeal  Lon  homologs  all  have  putative  N-terminal  membrane- 
spanning  regions,  unlike  the  eubacterial  and  eukaryal  homologs.  This  suggests  that  they 
are  associated  with  the  membrane  in  the  archaeal  cell  (158).  The  function  of  Lon  in  the 
archaeal  cytosol  is  as  yet  unknown.  An  ORF  has  also  been  identified  in  archaea  which 
was  originally  identified  as  the  Rpt6p  (human  S8)  ATPase  regulatory  subunit  of  the  26S 
proteasome  but  which  also  has  some  identity  to  the  FtsH  protease.  The  FtsH  homolog  in 
the  archaea  does  not  appear  to  have  the  Zn  -dependent  active  site  or  a  membrane- 
spaiming  region.  So,  in  contrast  to  archaeal  Lon,  the  FtsH  homolog  is  most  likely  soluble. 
It  has  been  proposed  that  the  membrane-bound  archaeal  Lon  protease  may  substitute  for 
FtsH  function  in  the  Archaea  (204),  and  that  the  FtsH/Rpt6p  homolog  is  not  a  protease 
but  rather  a  molecular  unfoldase  or  chaperone  involved  in  protein  degradation  or  folding 
(158). 

The  tricom  protease.  One  other  novel  protease,  the  tricom  protease,  has  thus  far 
been  studied  only  in  the  thermophilic  archaeon  T.  acidophilum  (244).  The  tricom 
protease  is  a  large  macromolecular  assembly  consisting  of  121-kDa  monomers  which 
associate  into  hexamers,  20  of  which  can  then  finally  assemble  into  a  large  icosahedral 
capsid  of  14.6  MDa  (242,  272).  The  capsid  serves  as  an  "organizing  center"  for  three 
interacting  factors,  including  a  proline  iminopeptidase  (245)  and  two  amino-peptidases 
(242).  The  tricom  protease  core  assembly  can  degrade  oligopeptides  of  6-12  residues 
processively  from  the  N-terminus  into  2-4  residue  peptides,  and  the  interacting  factors 
further  degrade  these  peptides  into  single  amino  acids  (242).  Furthermore,  the  tricom 


protease  is  an  exopeptidase  which  requires  the  N-terminus  of  the  substrate  to  be  exposed. 
Based  on  the  size  of  the  substrates  degraded,  it  is  thought  that  the  tricom  protease 
operates  downstream  from  the  proteasome  in  the  T.  acidophilum  cytosol  to  further 
degrade  peptides  into  amino  acids.  In  addition,  the  entire  proteolytic  pathway  has  been 
reconstituted  in  vitro  using  recombinant  20S  proteasome,  tricom  protease,  and  the 
interacting  factors  (242).  Based  on  sequence  analysis,  Sulfolobus  solfataricus  and  T. 
volcanium  both  contain  a  tricom  protease  homolog  as  well  as  the  interacting  factors. 
However,  the  Tricom  protease  core  has  no  sequence  identity  to  any  ORFs  from  other 
sequenced  archaeal  genomes,  though  both  the  proline  iminopeptidase  and  the  amino- 
peptidase  components  share  sequence  identity  to  other  zinc-dependent  amino-peptidases 
from  all  three  domains  of  life  (242). 

Other  Archaeal  proteases.  The  Pfpl  protein  is  a  novel  archaeal  intracellular 
protease  which  was  first  purified  from  P.  furiosus  (88)  and  which  also  appears  to  be 
conserved  in  all  other  completely  sequenced  archaea  with  the  exception  of  M 
thermoautotrophicum.  In  addition,  the  Clp  system  does  not  appear  to  be  in  any  of  the 
archaea  which  have  been  sequenced,  with  the  exception  of  one  gene  from  M. 
thermoautotrophicum  which  has  some  sequence  similarity  to  Clp  (HsplOO)  ATPase 
genes  from  eubacterial  organisms.  This  protein  may  function  solely  as  a  chaperone 
without  a  protease  component  (similar  to  ClpB)  or  may  perhaps  even  cooperate  with 
other  chaperone  systems,  as  CIpB  does  with  the  DnaK/DnaJ/GrpE  proteins  in  E.  coli 
(179).  However,  in  contrast  to  other  Clp  proteins,  the  M.  thermoautotrophicum  Clp 
homolog  appears  to  be  a  Type  I  AAA-ATPase  with  only  one  AAA  module.  None  of  the 


other  Archaea  that  have  been  completely  sequenced  appear  to  contain  this  protein, 
including  the  mesophilic  M  maripaludis. 

Extracellular  archaeal  proteases.  Many  archaeal  organisms,  especially 
hyperthermophiles  and  halophiles,  grow  very  well  on  protein  substrates  (270). 
Extracellular  proteases  are  necessary  for  the  organism  to  degrade  and  utilize  these  protein 
substrates.  Numerous  extracellular  proteases  have  been  isolated  from  hyperthemophiles, 
including  pyrolysin  from  P.  furiosus  (270),  a  subtilisin-like  serine  protease  with  high 
identity  to  eukaryal  tripeptidyl  peptidases  II  (TPPII).  However,  in  contrast  to  eukaryal 
TPPII,  pyrolysin  is  an  endopeptidase  rather  than  an  exopeptidase  (270). 

Archaeal  chaperones.  Archaeal  organisms  reside  in  a  wide  variety  of 
envirormients;  therefore,  their  mechanisms  for  dealing  with  stress  are  of  special  interest 
(reviewed  in  154, 158).  Archaeal  cells  do  not  appear  to  have  the  eubacterial 
GroEL(Hsp60)/GroES(Hspl0)  chaperone  system,  based  on  genome  sequence  analyses 
(Table  2).  The  most  well-conserved  chaperonin  among  the  Archaea  is  the  thermosome 
(Hsp60),  which  has  high  sequence  identity  to  the  TriC(CCT)  eukaryotic  chaperonin 
(reviewed  in  86,  130).  In  eukaryotic  cells,  the  TriC  chaperonin  is  a  double  ring  complex 
consisting  of  eight  different  subunits,  each  in  a  defined  position  within  the  ring  (150). 
The  archaeal  thermosome,  which  was  first  purified  from  Pyrodictum  occultum  cells 
exposed  to  heat  shock  (190),  is  also  composed  of  either  eight  or  nine-membered  double 
rings.  In  contrast  to  the  eukaryotic  counterpart,  the  thermosome  complexes  purified  from 
P.  occultum,  Sulfolobus  shibatae, (255),  Thermococcus  sp.  KS-1  (296),  and  T. 
acidophilum{\H)  consist  of  only  two  types  of  subunits,  alternating  a  and  (3  subunits 
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arranged  in  an  eight-  or  nine-membered  ring.  Sequences  from  H.  volcanii  (137),  A. 
fulgidus  (85),  M.  thermoautotrophicum  (1 12),  T.  volcanium,  (1 14),  and  A.  pernix  (1 12) 
indicate  that  these  organisms  also  contain  two  thermosome  subunits.  On  the  other  hand, 
some  of  the  hyperthermophilic  Archaea  including  M.  jannaschi  (136)  and  Pyrococcus  sp. 
(293)  have  only  one  gene  with  high  identity  to  other  archaeal  thermosome  subunit  genes 
(Table  2).  The  eukaryal  TriC/CCT  mediates  proper  folding  of  proteins  in  the  eukaryal 
cytosol,  including  the  cytoskeletal  proteins  actin  (61)  and  tubulin  (292).  The  thermosome 
most  likely  serves  a  similar  role  in  archaeal  cells.  In  addition,  the  thermosome  appears  to 
be  the  predominant  heat-shock  protein  in  the  cytosol  of  the  hyperthermophilic  archaeon 
P.  occultum  (191).  The  mechanism  for  both  the  archaeal  thermosome  and  the  eukaryotic 
TriC(CCT)  does  not  appear  to  be  dependent  on  a  separate  chaperonin  component  such  as 
GroES.  Based  on  crystal  structures  of  the  T.  acidophilum  thermosome  (45,  131),  it  has 
been  proposed  that  an  N-terminal  protusion  in  the  substrate-binding  apical  region  of  the 
thermosome  serves  as  the  flinctional  equivalent  of  GroES.  These  chaperones  have  thus 
been  named  Group  II  chaperonins  to  distinguish  them  from  Group  I  chaperonins  such  as 
GroEL  which  require  a  separate  co-chaperonin  for  activity.  However,  there  is  some 
evidence  that  there  may  be  a  component  which  interacts  with  TriC.  This  protein,  which  is 
called  prefoldin  (262)  or  GimC  (65),  has  been  shown  to  be  able  to  bind  actin  and  transfer 
it  to  the  TriC  complex.  Prefoldin  is  also  retained  on  a  TriC-ATP-agarose  column  (262). 
The  GimC/prefoldin  complex  forms  a  hexameric  ring  of  six  different  paralogous  subunits 
in  the  eukaryal  cytosol  (262),  but  it  is  not  known  exactly  how  this  would  interact  with  the 
octameric  TriC  compelex.  Homologs  of  GimC/prefoldin  are  conserved  in  the  archaea  and 
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eukarya,  while  they  are  absent  from  the  eubacteria.  Archaea  generally  have  one  or  two 
GimC/prefoldin  paralogs  (Table  2),  while  eukarya  contain  six. 

The  Hsp70  system,  consisting  of  the  DnaK  protein  as  well  as  the  cochaperones 
DnaJ  (Hsp40),  and  GrpE,  is  very  well  conserved  in  both  eukaryotes  and  eubacteria 
(reviewed  in  154).  Surprisingly,  it  is  not  as  well  conserved  in  the  archaea.  It  is  also 
intriguing  that  the  archaeal  DnaK/DnaJ/GrpE  proteins  are  more  similar  at  the  amino  acid 
level  to  eubacterial  than  to  eukaryal  chaperones.  In  addition,  the  archaeal  genes  are 
usually  found  in  an  operon  in  the  order  5' -grpE-dnaK-dnaJ-3\  which  is  identical  to  the 
order  found  in  eubacteria;  however,  the  intergenic  promoter  regions  and  transcription 
start  sites  are  eukaryotic-like  in  nature  (reviewed  in  154).  The  DnaK/DnaJ/GrpE  system 
appears  to  be  in  place  in  Methanosarcina  species  (99),  Halobacterium  species,  M. 
thermoautotrophicum,  and  T.  acidophilum,  while  it  is  not  present  in  Methanococcus 
species  (including  M.  maripaludis),  Sulfolobus  species,  Pyrococcus  species,  or  A. 
fulgidus,  based  on  genome  sequence  analysis  (Table  2).  It  has  been  proposed  that  the 
DnaK/DnaJ/GrpE  chaperone  system  is  replaced  by  another  unidentified  system  in  the 
majority  of  hyperthermophiles  which  may  be  tailored  to  the  special  conditions  of  their 
environment  (1 14). 

The  VAT  (VCP-like  ATPase  of  T.  acidophilum)  is  an  archaeal  homolog  of  the 
eukaryotic  Cdc48  and  p97/VCP  chaperones  (181).  All  sequenced  archaeal  genomes  - 
appear  to  contain  at  least  one  or  two  homologs  of  this  protein  (Table  2).  In  addition, 
homologs  have  been  identified  and  characterized  in  the  partially  sequenced  genomes  of  5. 
acidocaldarius  (38)  and  H.  salinarium  (54).  When  produced  in  recombinant  E.  coli,  VAT 
from  T.  acidophilum  forms  single  and  double  homohexameric  rings  which  resemble 
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Cdc48  and  p97  (57,  189)  and  further  associates  to  form  four-ringed  structures  that 
resemble  the  20S  proteasome  based  on  electron  microsopy  (181,  199).  Similarly  to  NSF 
and  other  Type  II  AAA-ATPases,  the  N-terminal  domain  is  not  necessary  for 
oligomerization  of  the  complex  and  appears  to  be  quite  flexible  (199).  In  addition,  AAA 
domains  Dl  and  D2  of  VAT  and  NSF  are  quite  similar  to  one  another  structurally  and 
can  be  superimposed  (199).  In  contrast  to  NSF  and  Cdc48,  VAT  is  stable  in  the  absence 
of  ATP  (76).  Interestingly,  VAT  can  act  to  either  fold  or  unfold  a  protein  substrate  in 
vitro,  depending  on  the  concentration  of  Mg    and  ATP  (76).  It  has  been  proposed  that 
this  chaperone  (and  others)  may  help  to  compensate  for  the  lack  of  a  PAN  homolog  in  T. 
acidophilum  (76). 

Small  heat  shock  proteins  (sHSPs/Hsp20s)  are  present  in  all  archaeal  genomes 
sequenced,  and  a  sHSP  homolog  from  M  jannaschii  has  been  cloned,  purified,  and 
characterized  (122).  When  produced  in  E.  coli,  the  16.5  kDa  protein  formed  a  homo- 
oligomeric  ~15-20run  spherical  structure  with  a  molecular  mass  of  -400  kDa  (122).  In 
addition,  E.  coli  cell  extracts  containing  this  protein  were  protected  from  thermal 
denaturation  and  precipitation  when  heated  to  100°C  (122).  The  purified  protein  also 
protected  both  single-chain  monellin  and  citrate  synthase  from  aggregation  at  80°  and  40° 
C,  respectively  (122).  The  crystal  structure  of  the  M  jannaschii  sHSP  was  obtained  at 
2.9A  resolution  (121)  and  revealed  that  the  24-subunit  complex  formed  a  hollow  sphere 
of  octahedral  symmetry  with  eight  triangular  and  six  square  openings.  The  openings  were 
proposed  to  be  possible  inlets/outlets  for  substrate  proteins.  It  was  also  hypothesized  that 
substrates  could  be  "trapped"  for  folding  inside  the  hydrophobic  interior  of  the  hollow 
sphere  similar  to  the  GroEL/ES  or  thermosome  complex  (121). 
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As  can  be  seen  from  the  discussion  above,  while  there  are  many  similarities 
among  all  archaeal  organisms,  there  are  also  intriguing  differences.  These  include  the 
lack  of  a  PAN  homolog  in  some  organisms,  the  lack  of  the  DnaK/DnaJ/GrpE  chaperone 
system  in  hyperthermophiles,  the  lack  of  a  Pfpl  protease  homolog  (but  the  presence  of  a 
Clp  homolog)  in  M.  thermoautotrophicum,  and  the  presence  of  the  tricom  protease  in  two 
archaeal  species.  Some  attempts  have  been  made  to  correlate  the  presence  or  lack  of 
certain  chaperones  and  other  components  with  the  optimal  growth  temperature  of  the 
archaeaon  (114).  However,  when  halophiles  and  mesophilic  methanococci  are  also 
considered,  the  picture  becomes  much  more  complicated.  In  addition,  reconciling  the  fact 
that  archaeal  cells  contain  both  a  eukaryotic-like  chaperonin  system  (the  thermosome), 
and,  in  some  cases,  a  bacterial-like  chaperone  system  (Hsp70/Hsp40/GrpE)  has  been  the 
subject  of  much  speculation. 

Methanogens  and  Genetic  Studies  in  Methanogens 

Performing  genetic  analyses  in  methanogenic  archaea  can  be  quite  challenging, 
because  most  of  the  genetic  techniques  which  have  been  developed  for  use  in  gram- 
negative  eubacteria  cannot  be  applied  to  these  organisms.  The  strict  requirement  for 
anaerobic  conditions  as  well  as  the  fragile  S-layer  cell  envelope  of  Methanococcus  sp., 
which  lyses  in  hypotonic  conditions,  can  make  electroporation  and  CaCli-mediated 
transformation  very  inefficient.  In  addition,  archaea  have  a  different  replication 
mechanism  than  eubacteria  (29,  50)  so  broad  host-range  vectors  fail  to  replicate  in  the 
archaea.  The  RNA  polymerase  is  more  complex  than  the  eubacterial  counterpart,  and 
some  of  the  transcription  factors  such  as  TBP  (75)  and  TFIIB  are  homologous  to  the 
eukaryal  proteins.  Furthermore,  the  promoter  regions  are  eukaryotic-like  with  a  TATA 
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box  and  initiator  element,  so  eubacterial  promoters  are  not  recognized  in  the  archaeal 
cell.  Most  antibiotics  developed  for  use  in  eubacteria  fail  to  inhibit  the  growth  of  archaea 
due  to  cell  wall  differences  as  well  as  different  protein  synthesis  mechanisms  such  as 
ribosome  structure.  Other  techniques  such  as  blue/white  selection  using  the  reporter  gene 
lacZ  and  the  substrate  X-gal  must  be  modified  {e.g. ,  by  replica  plating  before  screening) 
because  the  formation  of  the  blue  color  produced  from  the  breakdown  of  X-gal  requires 
oxygen. 

Thermophilic  organisms  present  special  challenges  because  cultivation  can  be 
time  consuming  and  difficult,  especially  on  solid  media;  antibiotics  as  well  as  the  protein 
which  confers  antibiotic  resistance  may  be  unstable  at  high  temperatures;  and  unprotected 
vector  DNA  may  also  be  unstable  at  high  temperatures  (reviewed  in  174).  Auxotrophic 
mutant  screening  is  also  difficult  due  to  a  lack  of  growth  of  most  thermophilic  organisms 
on  defined  media  in  the  laboratory  (174).  Therefore,  while  some  advances  have  been 
made  in  the  hyperthermophilic  species  Sulfolobus  and  Pyrococcus  (174),  little  is  known 
about  the  metabolism  or  physiology  of  M  jannaschii  other  than  that  which  can  be 
determined  from  analysis  of  the  genome  sequence  (213).  In  contrast,  two  mesophilic 
methanogen  species,  Methanococcus  voltae  and  M  maripaludis,  have  been  studied 
extensively  using  genetic  techniques  in  recent  years.  Both  of  these  methanogens  have 
simple  protein  S-layer  cell  envelopes,  which  are  easily  disrupted  for  DNA  isolation  as 
well  as  for  spheroplast  formation.  In  addition,  these  methanogens  generally  grow  as  non- 
flagellated  single  cells,  with  little  clumping,  allowing  isolation  of  single  cells  and 
colonies  (107).  The  relatively  rapid  doubling  time  and  mesophilic  nature  of  these 


42 

microorganisms  have  made  them  popular  models  for  performing  genetic  analyses  in 
methanogens  (reviewed  in  258  and  283).  '  ; 

Genetic  studies  in  M.  voltae.  M.  voltae  was  isolated  from  the  Waccasassa  estuary 
near  Cedar  Key,  Florida  (276)  and  has  a  doubling  time  of  1 .2  hours  with  an  optimal 
growth  temperature  of  38°C  (282).  M.  voltae  was  the  first  methanogen  in  which 
transformation  was  successfully  performed.  In  this  study,  the  authors  were  able  to  rescue 
histidine  and  purine  auxotrophs  by  transformation  with  wild-type  chromosomal  DNA 
(17).  In  a  later  study,  M.  voltae  was  tranformed  with  the  integration  vectors  Mipl  and 
Mip2  (68).  These  vectors  were  derived  from  plasmid  pUC18  (containing  an  ampicillin 
resistance  marker  and  an  origin  of  replication  for  manipulations  in  E.  coli).  In  addition, 
the  vector  contained  the  pac  (puromycin  transacetylase)  gene  from  Streptomyces 
alboniger  (139)  with  promoter  and  terminator  regions  from  the  mcr  (methyl  CoM 
reductase)  operon  of  M.  voltae,  for  puromycin  resistance  selection  in  M.  voltae.  The  pac 
cassette  was  flanked  by  hisA  gene  sequences  to  promote  homologous  recombination  and 
integration  into  the  chromosome  (68).  Histidine  auxofrophs  could  then  be  isolated  and 
analyzed.  In  both  of  these  studies,  the  transformation  efficiency  was  very  low  (~10 
transformants  per  ^g  DNA).  The  reporter  genes  uidA  (P-glucoronidase  from  E.  coli)  and 
treA  (trehalose  gene  from  Bacillus  subtilis)  have  also  been  used  in  M.  voltae  to  study  the 
regulation  of  [NiFe]-hydrogenase  genes  (14,  16).  M.  voltae  contains  four  sets  of 
hydrogenase  genes  (87).  Two  sets  ifru  and  vhu)  contain  selenocysteine  and  are 
constitutively  transcribed.  The  other  two  sets  {frc  and  vhc)  contain  a  cysteine  rather  than 
a  selenocysteine  residue  at  the  homologous  position  and  are  transcribed  only  in  the 
absence  of  selenium  (15).  The  frc  and  vhc  genes  are  divergently  transcribed  from  a 
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common  453-bp  intergenic  promoter  region.  When  either  the  frc  or  the  vhc  gene  was 
replaced  with  the  reporter  gene  uidA,  P-glucuronidase  activity  could  be  measured  only  in 
cells  grown  in  the  absence  of  selenium  (14).  Furthermore,  by  placing  the  intergenic 
region  between  uidA  and  treA  in  the  same  construct,  it  could  be  shown  directly  that  these 
two  genes  are  coordinately  regulated  (16).  Finally,  by  mutating  specific  regions  of  this 
intergenic  promoter  region  in  the  above  constructs,  it  was  found  that  these  two  operons 
were  both  under  the  control  of  a  single  negative  regulatory  element  (similar  to  a 
eukaryotic  silencer)  as  well  as  activator  regions  specific  for  each  of  the  two  hydrogenase 
operons  (173). 

One  limitation  in  the  use  of  M  voltae  as  a  model  organism  to  study  methanogen 
physiology  is  that  while  it  can  synthesize  the  amino  acids  isoleucine  and  leucine,  it  has  a 
strict  growth  requirement  in  the  laboratory  for  these  amino  acids.  This  makes  screening 
for  auxotrophic  mutants  difficult  (223).  In  addition,  no  plasmid  has  been  found  in  this 
organism,  which  has  prevented  the  development  of  a  self-replicating  shuttle  vector. 
Currently,  the  transformation  efficiency  is  no  higher  than  the  frequency  of  spontaneous 
puromycin  resistant  mutants.  This  makes  statistical  analyses  of  transformants  and 
auxotrophic  mutants  difficult. 

Genetic  studies  in  M.  maripaludis.  M.  maripaludis  was  isolated  from  a  salt-water 
marsh  and  has  a  doubling  time  of  -2.3  hours  with  an  optimal  growth  temperature  of  37° 
C  (106).  In  contrast  to  M.  voltae,  M.  maripaludis  is  a  nutritionally  versatile  facultative 
autotroph.  M.  maripaludis  will  assimilate  amino  acids,  acetate,  and  formate,  but  the  cells 
can  also  grow  with  CO2  as  the  sole  carbon  source.  These  characteristics  have  made 
possible  both  auxotrophic  mutant  screening  as  well  as  labeling  studies  to  study  carbon 
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metabolism.  Before  discovery  and  development  of  any  antibiotic  resistance  genes  in 
methanogens,  acetate  auxotrophs  were  isolated  after  enrichment  using  the  base  analogs  6- 
azauracil  and  8-azahypoxanthine  (140).  This  study  provided  the  first  direct  genetic 
evidence  that  methanogenic  Archaea  contain  the  Ljungdahl-Wood  pathway  for 
autotrophic  CO2  fixation  and  acetyl-CoA  biosynthesis. 

Integrative  vectors  have  been  developed  for  use  in  M.  maripaludis  (63),  based  on 
the  original  Mip  vectors  from  M.  voltae,  by  using  a  fragment  of  M.  maripaludis  genomic 
DNA  in  place  of  the  hisA  gene  sequences  from  M.  voltae  to  allow  for  homologous 
recombination  into  the  chromosome.  An  integrative  expression  vector,  pWLG14,  was 
also  developed  by  placing  a  multiple  cloning  region  downstream  of  the  M.  voltae  histone 
promoter  (PhmvA)  which  allows  for  constitutive  overexpression  of  a  protein  of  interest 
after  integration  into  the  chromosome  (63).  A  PEG-based  transformation  method  that 
yields  10^  to  1 0^  tranformants  per  |ig  DNA  has  also  greatly  aided  genetic  studies  in  this 
and  other  methanogens  (260).  In  contrast  to  M.  voltae,  no  spontaneous  puromycin 
resistance  mutants  are  observed  in  M  maripaludis,  so  analysis  of  transformation 
frequency  is  more  direct. 

The  8,285-bp  cryptic  plasmid  pURBSOO,  which  was  isolated  from  M.  maripaludis 
strain  C5  (291),  has  formed  the  basis  for  development  of  an  expression  shuttle  vector  for 
use  in  this  organism.  The  vector,  pWLG30,  is  12.7-kbp  and  was  constructed  using  the 
integrative  expression  vector  pWLG14  as  well  as  a  large  portion  of  pURBSOO  (259).  This 
self-replicating  vector  is  stably  maintained  in  M.  maripaludis  and  supports  constitutive 
expression  of  heterologous  genes.  No  inducible  promoter  has  been  discovered  at  this 
time,  so  control  of  the  timing  and  level  of  expression  of  the  protein  of  interest  is  not 
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possible.  The  expression  vector  is  useful  for  producing  proteins  which  are  oxygen 
sensitive  and/or  may  require  additional  archaeal  factors  for  activity.  Unfortunately,  this 
vector  will  not  replicate  in  other  closely  related  Methanococcus  species,  including  M 
voltae  (259).  In  addition,  a  minimal  replication  region  has  not  been  determined  for  this 
plasmid;  thus,  large  portions  of  pURBSOO  are  required  for  replication  in  M  maripaludis. 
The  pURB500  vector  contains  18  ORPs,  most  of  which  seem  to  be  important  in 
maintaining  the  vector  in  the  methanogen  cell  (63,  259).  Recently,  a  smaller  vector, 
pWLG40,  has  been  constructed.  This  plasmid  is  10.2-kbp  and  is  maintained  in  E.  coli  and 
M.  maripaludis  at  a  higher  copy  number  than  pWLG30  (personal  communication, 
W.Gardner  and  W.  Whitman). 

In  conclusion,  developing  a  model  for  protein  folding  and  degradation  in  archaeal 
organisms  will  be  a  challenging  and  enlightening  task.  This  will  be  greatly  aided  by 
obtaining  the  complete  genome  sequences  of  some  archaeal  organisms  for  which  genetic 
systems  have  been  developed  such  as  H.  volcanii,  M.  maripaludis,  and  M  voltae.  In 
addition,  using  some  of  the  genetic  techniques  which  have  been  developed  for  use  in 
these  organisms  should  answer  some  of  the  questions  which  have  been  raised  in  this 
review  as  well  as  in  others. 


MATERIALS  AND  METHODS 
Materials 

Biochemicals  were  purchased  from  Sigma  Chemical  Co.  (St.  Louis,  Mo.).  Other 
organic  and  inorganic  chemicals  were  from  Fisher  Scientific  (Atlanta,  Ga.)  and  were 
analytical  grade.  Restriction  endonucleases  and  DNA  modifying  enzymes  were  from 
New  England  Biolabs  (Beverly,  Mass.)  or  Promega  (Madison,  Wis.).  Oligonucleotides 
were  from  Genemed  Synthesis  (San  Francisco,  Calif.)  or  from  Sigma-Genosys 
(Woodland,  Tex.).  Positively-charged  nylon  membranes,  digoxigenin-1 1-dUTP  (DIG), 
and  alkaline  phosphatase-conjugate  antibody  raised  against  DIG  were  from  Roche 
Biochemicals  (Indianapolis,  Ind.).  Polyvinylidene  difluoride  (PVDF)  and  positively- 
charged  nylon  membranes  were  from  MicroSeparations  (Westborough,  Mass.)  and 
Ambion  (Austin,  Tex.),  respectively.  The  A,BlueStar  vector  and  phage  packaging  extracts 
were  from  Novagen  (Madison,  Wis.).  Morph  site-specific  plasmid  DNA  mutagenesis  kit 
was  from  5  Prime-^  3  Prime  (Boulder,  Colo.).  Zero  Blunt-TOPO  PCR  cloning  kit  and 
the  pCR-TOPlO  vector  were  from  Invitrogen  (Carlsbad,  Calif.).  Goat  anti-rabbit  IgG 
alkaline  phosphatase-conjugate  was  from  Southern  Biotechnology  Associates 
(Birmingham,  Ala.).  The  Mta  and  Mtp  antibodies  raised  against  the  a  and  P  subunits, 
respectively,  of  the  M.  thermophila  20S  proteasome  were  described  previously  (156). 
Clones  containing  M  jannaschii  open  reading  frames  MJ0591,  MJ1237,  and  MJ0591 
were  obtained  from  The  Institute  for  Genomic  Research  (Rockville,  Md.). 
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Strains  and  Media 

Bacterial  and  archaeal  strains  used  in  this  study  are  listed  in  Table  3.  E.  coli 
strains  were  grown  in  Luria  Broth  (LB),  or  LB  supplemented  with  an  appropriate 
antibiotic,  including  34  mg  chloramphenicol,  50  mg  spectinomycin,  12.5  mg  tetracycline, 
100  mg  ampicillin,  and/or  50  mg  kanamycin  per  liter  unless  otherwise  indicated.  The 
ER1647  cells  used  as  a  host  strain  for  X  phage  were  grown  in  LB  supplemented  with 
tetracycline,  0.2%  maltose,  and  10  mM  MgSO4-7H20.  The  BM25.8  cells  used  for  Cre/lox- 
mediated  subcloning  of  plasmids  from  XBlueStar  phage  DNA  were  grown  in  LB 
supplemented  with  chloramphenicol,  kanamycin,  0.2%  maltose,  and  10  mM  MgS04.  The 
XL-1  Blue  MRF'  strain  containing  M.  maripaludis  shuttle  vectors  pWLG30  or  pWLG40 
was  grown  in  low-salt  LB  (0.5%  NaCl)  containing  60  mg  ampicillin  per  liter  according  to 
Gardner  and  Whitman  (63).  The  M.  jannaschii  strain  JAL-1  was  grown  to  mid-log  phase 
in  MSH  medium  under  a  80:20  H2:C02  gas  mix  at  40  Ib/in^  according  to  Boone  et  al. 
(20).  The  M.  maripaludis  strain  JJ  was  grown  in  either  MG  medium  (20),  McC,  or  McN 
medium  (257).  Liquid  cultures  were  maintained  in  80:20  H2:C02  gas  mix  at  40  Ib/in^  for 
liquid  cultures  and  at  1 5  Ib/in^  for  plates.  The  M.  maripaludis  strains  containing  the 
pWLG30  or  pWLG40  expression  vectors  were  grown  in  McC  media  containing  2.5  mg 
per  liter  puromycin. 

DNA  and  Transformation  Techniques 
The  concentration  of  genomic  and  plasmid  DNA  was  determined  by  measuring 
absorbance  at  260  nm  using  a  DU640  spectrophotometer  (Beckman,  Schaumburg,  111). 
Plasmids  from  E.  coli  and  M  maripaludis  strains  were  isolated  using  the  Quantum 
Plasmid  Mini-Prep  kit  (Bio-Rad,  Hercules,  Calif)  with  the  following  modifications  for 
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TABLE  3.  Bacterial  and  Archaeal  strains  used  in  this  study 


Strain 

Description  and/or  Genotype 

Source 

E.  co// BL21(DE3) 

¥'ompT\lon]hsdSR  CrRmR")  with  DEB  a  X 
prophage  carrying  the  11  RNA  polymerase 
gene (238) 

E.  coli  BM25.8 

supE  thi  A(lac-proAB)  [F'  traD36 
pro^^fi^/ac/'Z  AMI 5]  Ximm^^^kan'^)  PI 

Novagen 

E.  coli  DH5a 

F"  e«(i47  hsdRl  7  (rk"m  k"^)  supE44  thi-1  recAl 
gyrA  (Nal')  A-e/y47  A(lacIZYA-argF)U] 69 
deoR  (^S0dlacIA(lacZ)M15) 

Life 

Technologies 

E.  CO// ERl  647 

VfliuA2  MlacZ)rl  supE44  recDlON  trp'Sl 
mcrA1272::TnlO  (Tc^)  his-1  rpsL104  (str^) 
xyl7  mtl'2  metBl  A{mcrC-mrr)102:-JnlO 
(Tc'')hsdS{Ti,nmu2^) 

Novagen 

E.  coli  MORPH 
mutS 

A{mcrA)  A{mcrBC-hsdSMR-mrr)  endAl  supE 
thi  gyrA  relAl  lac  mutS'.-.lnlO  (Tet') 
[F'proAB  lacP  lacZ  AMI 5  Tn5] 

5  Prime->3 
Prime 

E.  colilB-l 

Vara  A{lac-proAB)  rpsL{StT^)  [(p80d/acA 
(lacZ)M\5]  thi  hsdR  (viCmK) 

New  England 
Biolabs 

E.  coliXh-X  Blue 
MRF' 

\(mrrA\l KtmrrRC  hv/i'^A/fP  mrrMJ'i 

endAl  supE44  thi-1  recAl  gyrA96relAl  lac 
[F'  proAB  lacf  ZAM15  TnlO  (Tet')] 

o  iiaiagene 

M  jannaschiiiALA 

Wild  type  (108) 

David  Boone, 
Portland  State 
Univ. 

M.  maripaludis  JJ 

Wild  type  (106) 

David  Boone, 
Portland  State 
Univ. 
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M.  maripaludis.  After  1  ml  of  a  mid-log  culture  was  centrifuged,  the  pellet  was 
resuspended  in  McC  medium  rather  than  the  resuspension  buffer  provided  by  the 
manufacturer  to  prevent  premature  lysis  of  the  cells.  The  E.  coli  strains  listed  in  Table  3 
were  transformed  with  plasmid  DNA  by  either  chemical  transformation  or 
electroporation.  The  M.  maripaludis  strain  JJ  was  transformed  using  techniques 
according  to  Tumbula  et  al.  (260)  with  modifications  as  per  W.  L.  Gardner  (62).  For  ail 
other  molecular  biology  procedures,  standard  techniques  were  employed  (206). 

Protein  Techniques  and  Enzyme  Assays 
Protein  concentrations  were  determined  by  the  bicinchoninic  acid  method  (232) 
(Pierce,  Rockford,  111.)  and/or  the  Coomassie  blue  dye-binding  method  (22)  (BioRad, 
Hercules,  Calif.)  using  bovine  serum  albumin  (BSA)  as  the  standard.  The  a-amino 
groups  generated  by  peptide-bond  hydrolysis  were  measured  by  fluorescence  at  pH  6.8 
using  fluorescamine  with  methionine  as  a  standard  (261).  Fluorescence  was  measured 
using  an  Aminco  Bowman  Series  2  Luminescence  Spectrometer  (Spectronic  Instruments, 
Rochester,  NY)  with  an  excitation  wavelength  of  370  nm  and  an  emission  wavelength  of 
480  nm.  Peptide-hydrolyzing  activity  was  assayed  by  fluorimetric  measurement  of  the 
release  of  7-amino-4-methylcoumarin  or  the  colorimetric  measurement  of  the  release  of 
P-naphthylamine  using  synthetic  peptide  substrates  as  previously  described  (156,  157). 
Nucleotidase  activity  was  assayed  by  measuring  the  release  of  inorganic  phosphate  (Pi) 
using  malachite  green  and  ammonium  molybdate  with  modifications  as  previously 
described  (132, 142).  For  analysis  of  various  nucleotide-hydrolyzing  activities,  enzyme 
(2  i^g  per  ml)  was  incubated  with  1  mM  nucleotide  in  25  mM  TES  at  pH  8.0  with  100 
mM  NaCl  and  10  mM  MgCh  (TES-NaCl-MgCb  buffer)  at  80°C  for  MjPAN  or  65°C  for 
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MjPAN(Al-73).  Specific  activities  are  reported  as  nanomoles  of  product  •  min"'  ■  mg"'  of 
protein.  For  determination  of  kinetic  parameters,  enzyme  (4  ng  per  ml)  was  incubated 
with  ATP  or  CTP  at  0.05,  0.1,  0.25,  0.5,  and  1  mM  using  similar  assay  conditions.  Initial 
velocity  was  determined  by  measuring  the  release  of  Pj  in  triplicate  using  four  time  points 
from  0  to  6  min.  Initial  velocity  was  plotted  as  a  function  of  substrate  concentration  by 
the  Lineweaver-Burk  method  (39).  Similar  results  were  obtained  using  Michaelis-Menten 
and  Eadie-Hofstee  plots  (39).  The  effects  of  nucleotide  diphosphates  were  measured  by 
preincubating  the  enzyme  (2  ^g  per  ml)  with  or  without  1  mM  CDP  or  ADP  for  1 5  min 
in  TES-NaCl-MgCb  buffer  at  2 1  °C  before  addition  of  1  mM  ATP  or  CTP  and  then 
assayed  as  described  above.  The  pH  optimum  of  enzymes  was  determined  using  the 
following  buffers  at  25  mM:  MES  (pH  5  to  7),  TES  (pH  7  to  8),  Tricine  (pH  8  to  8.8), 
andCAPSO  (pH8.9to  10). 

Molecular  Mass  Determinations 
Molecular  masses  of  the  purified  PAN  and  20S  proteasome  proteins  were 
determined  by  reducing  and  denaturing  sodium  dodecyl  sulfate  (SDS)-polyacrylamide 
gel  electrophoresis  (PAGE)  using  12%  polyacrylamide  gels  (141)  which  were  stained 
with  Coomassie  blue  R-250.  The  molecular  weight  standards  for  SDS-PAGE  were 
phosphorylase  b  (97.4  kDa),  serum  albumin  (66.2  kDa),  ovalbumin  (45  kDa),  carbonic 
anhydrase  (31  kDa),  trypsin  inhibitor  (21.5  kDa),  and  lysozyme  (14.4  kDa)  (BioRad). 
Native  molecular  masses  of  the  complexes  were  determined  by  applying  purified  proteins 
to  a  Superose  6  HR  10/30  column.  For  the  PAN  protein  from  M.  jannaschii,  the  column 
was  equilibrated  in  20  mM  Tris  buffer  containing  150  mM  NaCl  at  pH  8.0.  The 
molecular  mass  standards  were  applied  to  the  same  column  equilibrated  in  50  mM  Tris- 
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HCl  buffer  with  150  mM  NaCl  and  1  mM  DTT  at  pH  7.2.  These  standards  included: 
serum  albumin  (66  kDa),  alcohol  dehydrogenase  (150  kDa),  P-amylase  (200  kDa), 
apoferritin  (443  kDa),  and  thyroglobulin  (669  kDa). 

Cloning  the  M.  jannaschii  Proteasome  Genes 
Plasmid  pAMJHW03  (26)  containing  the  open  reading  frame  MJl  176  encoding 
the  putative  PAN  protein  was  obtained  from  the  American  Type  Culture  Collection 
(ATCC)  (Rockville,  MD).  Polymerase  chain  reaction  (PCR)  was  used  to  amplify  DNA 
fragments  carrying  the  M  jannaschii  pan  ORF  and  contained  10  mM  Tris-HCl  buffer  at 
pH  8.8  containing  10  mM  KCl,  10  mM  (NH4)2S04, 2  to  6  mM  MgS04,  0.1%  Triton  X- 
100, 250  |xM  dNTPs,  1 .5  |iM  each  primer  and  2,500  ng  double  stranded  plasmid  DNA 
template,  and  0.02  U/|il  Vent  (exo^)  DNA  polymerase  with  a  Gene  Cycler  (BioRad). 
Oligonucleotide  primers  and  template  plasmid  DNA  are  presented  in  Table  4.  The  DNA 
fragments  generated  by  PCR  amplification  were  cloned  into  expression  vectors  by  either: 
1)  phosphorylating  the  PCR  fragment  with  T4  polynucleotide  kinase,  ligating  the 
fragment  into  the  Hindi  site  of  pUC19,  and  then  ligating  the  pUC19  cloned  fragment 
into  pET24b  using  Ndel  and  EcoBJ  or  2)  directly  ligating  the  PCR  fragment  into  pET24b 
or  pET15b  using  Ndel,  EcoBJ,  orXhol  (Table  4).  The  DNA  sequences  of  the  fragments 
generated  by  PCR  amplification  were  verified  by  DNA  sequence  analysis  of  both  strands 
of  plasmids  pMjPANS,  pMjPAN6,  pMjPAN8,  pJAM735,  and  pJAM736. 
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TABLE  4.  Plasmids  used  in  this  study. 


Plasmid  (stock  #) 


Phenotype  or  genotype;  oligonucleotides  for  PCR 
amplification  or  site-directed  mutagenesis 


Source 


pCR-TOPO 

pET24b 

pET15b 

pWLG40  +  lacZ 

pSJS1240 

pAMJL73 


pAMJKH57 

pAMJHW03 

pMjPANl  (#340) 

pMjPAN2  (#447) 
pMjPAN3  (#487) 


pMjPANS  (#493) 


pMjPAN6  (#505) 


Km  ,  cloning  vector  Invitrogen 

Km*^;  expression  vector  Novagen 

Ap*^;  expression  vector  Novagen 

Ap"^;  Pur"^;  expression  shuttle  vector  (63) 

Sp'^;  E.  coli  ileX  and  argU  (1 24) 

Ap"^;  2.385-kb  of  M  jannaschii  genomic  DNA  ATCC 
bases  524549  to  526933  of  L77117  in  pUClS; 
carries  complete  ORE  MJ0591  (a  subunit  of  20S 
proteasome) 

Ap*^;  2.036-kb  of  M  jannaschii  genomic  DNA  ATCC 
bases  11 82020  to  11 84055  of  L771 17  in  pUCl 8; 
carries  complete  ORE  MJ1237  (p  subunit  of  20S 
proteasome) 

Ap  ;  2. 1 57-kb  of  M  jannaschii  genomic  DNA  ATCC 
bases  1114515  to  1 1 16671of  L771 17  in  pUC18; 
carries  complete  ORE  MJl  176  (proteasome- 
activating  nucleotidase) 

Ap"^;  Nde\  to  E'coRV  1 .6-kb  of  M  jannaschii  this  study 

genomic  DNA  ligated  into  the  HincW  site  of 
pUC19;  carries  84-bp  upstream  and  191 -bp 
downstream  of  the  complete  MJl  176 

Km"^;  1 .6-kb  Ecom-Xo-Hindm  fragment  of  this  study 

pMjPANl  ligated  into  pET24b;  produces  PAN  and 
PAN(A  1-73)  proteins 

Ap"^;  1 ,3 1 2-bp  fragment  generated  by  PCR  this  study 

amplification  using  pAMJHW03;  carries  the 
complete  MJl  176  ligated  into  pUC19  using  Hindi; 
5'-ccgccca/fl(ggtttttgaagaatt-3'  and  5'- 
gcggcgaa^ttatctgtagagaa-3'  {Ndel  and  ^coRI  sites 
indicated) 

Ap"^;  1 302-bp  Ndel-Xo-Ecom  fragment  of  this  study 

pMjPAN3  ligated  into  pETlSb;  produces  full- 
length  PAN  protein  with  His6  at  N  terminus  and  39- 
kDa  PAN(Al-73) 

R  • 

Ap  ;  site-directed  mutation  of  pMjPAN3  which         this  study 
disrupts  the  internal  Shine  Dalgamo  site;  5'- 
gagcttgaacg/gaflaatttacagttaatg-3';  altered  bases 
indicated  in  italics 
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TABLE  4.  Cont. 


Plasmid  (stock  #) 


Phenotype  or  genotype;  oligonucleotides  for  PCR 
amplification  or  site-directed  mutagenesis 


Source 


pMjPAN?  (#511) 


pMjPAN8  (#528) 


pJAM721  (#617) 


pJAM727  (#648) 


pJAM728  (#650) 


pJAM730  (#655) 


Km  ;  1302-bp  Ndel-to-EcoRl  fragment  of  this  study 

pMjPAN6  which  carries  the  complete  MJl  176  with 
the  internal  translation  mutation  ligated  into 
pET24b;  produces  PAN  of  47  kDa 

Km'^;  1083 -bp  Ndel-to-EcoRl  fragment  generated      this  study 
by  PCR  amplification  using  pAMJHW03 ;  carries 
MJl  1 76  with  a  2 1 9-bp  5'-deletion  (N-terminal  73- 
residue  truncation)  ligated  into  pET24b;  produces 
truncated  PAN(Al-73)  only;  5'- 
tccatofgaaagaaaatgagattttaa-3'  and  5'- 
gcggcgaaa^ttatctgtagagaa-3';  Ndel  and  EcoRl 
sites  indicated 

Ap*^;  contains  ~1 5-kbp  insert  of  genomic  DNA  this  study 

from  M.  maripaludis  obtained  from  automatic  Cre- 
lox  mediated  subcloning  of  A,BlueStar  vector; 
carries  entire  1220-bp  Mmpan  gene  and  upstream 
and  downstream  regions 

Km"^;  1220-bp  Ndel-Xo-BamWl  fragment  generated     this  study 
by  PCR  amplification  using  pJAM721  as  a 
template;  carries  entire  Mmpan  gene  ligated  into 
pET24b;  produces  MmPAN  protein  of  45-kDa;  5'- 
gacgcaacflto/gagttacccagatg-3 '  and  5'- 
atagtggfl/ccttatccgtacattac-3';  Ndel  and  BamWl  sites 
indicated 

Ap"^;  1220-bp  Nde\-Xo-Bam\{\  fragment  generated      this  study 
by  PCR  amplification  using  pJAM721  as  a 
template;  carries  entire  Mmpan  gene  ligated  into 
pET15b;  produces  45-kDa  MmPAN  protein  with 
His6tag  at  N-terminus;  5'- 
gacgcaacato/gagttacccagatg-3'  and  5'- 
atagtggafccttatccgtacattac-3';  Ndel  and  BamUl  sites 
indicated 

Km  ;  1220-bp  Ndel-\.o-Xhol  fragment  generated  by    this  study 
PCR  amplification  using  pJAM721  as  a  template; 
carries  entire  Mmpan  gene  ligated  into  pET24b; 
produces  45-kDa  MmPAN  protein  with  Hisetag  at 
C-terminus;  5'-gacgcaacflto<'gagttacccagatg-3 '  and 
5'-ctagtagtcfcgagtccgtacattac-3 ' 
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TABLE  4.  Cont. 


Plasmid  (stock  #) 


Phenotype  or  genotype;  oligonucleotides  for  PCR  Source 
amplification  or  site-directed  mutagenesis 

Km*^;  Ndel-to-Xhol  fi-agment  generated  by  PCR        this  study 
amplification  using  pMjPANl  as  a  template;  carries 
entire  unmodified  Mjpan  gene  ligated  into  pET24b; 
produces  47-kDa  MjPAN  protein  with  Hisetag  at 
C-terminus;  S'-ccgcgco/o/ggtttttgaagaatt-S'  and 
5'tagtgatcfcgflgtctgtagagaac-3';  Ndel  and  Xhol  sites 
indicated 

Km"^;  Ndel-to-Xhol  fi-agment  generated  by  PCR        this  study 

amplification  using  pMjPAN?  as  a  template;  carries 

entire  Mjpan  gene  with  the  internal  translation 

mutation  ligated  into  pET24b;  produces  47-kDa 

MjPAN  protein  with  Hise  tag  at  C-terminus;  same 

primers  used  as  for  construction  of  pJAM735 

Ap^;Pm^,Pstl-to-Bg[ll  fragment  generated  by  PCR     this  study 
using  pJAM727  as  a  template;  carries  entire 
Mmpan  gene  (plus  the  T7  terminator  sequence  from 
pET24b)  ligated  into  pWLG40;  produces  45-kDa 
MmPAN  protein  in  recombinant  M  maripaludis; 
5  '-ggagtacfgcflgatgagttacccag-3 '  and  5'- 
catgaagfl/c/gcttaatgcgccgc-3';  Pstl  and  BgRl  sites 
indicated 

Ap'^;Pur'^;  Pstl-Xo-BgUl  fragment  generated  by  PCR    this  study 
using  pJAM730  as  a  template;  carries  entire 
Mmpan  gene  (plus  the  T7  terminator  sequence  from 
pET24b)  ligated  into  pWLG40;  produces  45-kDa 
MmPAN  protein  with  His6  tag  at  C-terminus  in 
recombinant  M.  maripaludis;  same  primers  used  as 
for  construction  of  pJAM737 

Ap'^;Pur'^;  Pstl-Xo-BgUl  fragment  generated  by  PCR    this  study 
using  pJAM728  as  a  template;  carries  entire 
Mmpan  gene  (plus  the  T7-  terminator  sequence 
from  pET24b)  ligated  into  pWLG40;  produces  45- 
kDa  MmPAN  protein  with  Hise  tag  at  N-terminus 
in  recombinant  M.  maripaludis;  5'- 
aaacfgcagatgggcagcagccatc-3 '  and  5'- 
acatgaflga/c/cgggcctcttgcg-3';  Pstl  and  BgRl  sites 
indicated 


pJAM735  (#747) 


pJAM736  (#749) 


pJAM737  (#769) 


pJAM738  (#754) 


pJAM739  (#830) 
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TABLE  4.  Cont. 


Plasmid  (stock  #) 


Phenotype  or  genotype;  oligonucleotides  for  PCR 
amplification  or  site-directed  mutagenesis 


Source 


Ap'^;Pur'^;  Nsil-to-BgUl  fragment  generated  by  PCR   this  study 
using  pJAM735  as  a  template;  carries  entire 
unmodified  Mjpan  gene  (plus  the  T7  -terminator 
sequence  from  pET24b)  ligated  into  pWLG40; 
produces  47-kDa  MjPAN  protein  with  His6  tag  at 
C-terminus  in  recombinant  M  maripaludis;  5'- 
gcggfl/gca/ggtttttgaagaatt-3'  and  5'- 
catga£gflfc/gcttaatgcgccgc-3';  Nsi\  and  Bghl  sites 
indicated 

Ap'^;Pur'^;  Nsil-Xo-BgR\  fragment  generated  by  PCR    this  study 

using  pJAM736  as  a  template;  carries  entire  M]pan 

gene  with  the  internal  translation  mutation  (plus  the 

T7  -terminator  sequence  from  pET24b)  ligated  into 

pWLG40;  produces  47-kDa  MjPAN  protein  with 

Hise  tag  at  C-terminus  in  recombinant  M. 

maripaludis;  same  primers  used  as  for  pJAM740 

Ap'^;Pur'^;  Nsil-Xo-Bglil  fragment  generated  by  PCR    this  study 

using  pMjPAN7  as  a  template;  carries  entire  M]pan 

gene  with  the  internal  translation  mutation  (plus  the 

T7  -terminator  sequence  from  pET24b)  ligated  into 

pWLG40;  produces  47-kDa  MjPAN  protein  in 

recombinant  M.  maripaludis;  same  primers  used  as 

for  pJAM740 

Ap'^;Pur'^;  Nsil-Xo-Bglil  fragment  generated  by  PCR    this  study 
using  pMjPAN5  as  a  template;  carries  entire 
unmodified  yi]pan  gene  (plus  the  T7  -terminator 
sequence  from  pET24b)  ligated  into  pWLG40; 
produces  47-kDa  MjPAN  protein  with  Hise  tag  at 
N-terminus  in  recombinant  M.  maripaludis;  5'- 
agataa/gcargggcagcaRccatc-3 '  and  5'- 
acatgaagfl/Wcgggcctcttgcg-3';  Nsil  and  BgRl  sites 
indicated 

Km"^;  750-bp  fragment  generated  by  PCR  this  study 

amplification  using  M  maripaludis  genomic  DNA 
as  a  template;  carries  333-bp  upstream  and  417-bp 
of  the  5'  region  ofMmpan  ligated  into  pCR-TOPO; 
5'-cctcttttccgcagagttcgcattg-3'  and 
5  'gatttccattgccatagctcggtg-3 ' 


pJAM740  (#771) 


pJAM741  (#774) 


pJAM742  (#826) 


pJAM743  (#828) 


pJAM745  (#800) 
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Site-directed  Mutagenesis 

Oligonucleotide-directed  mutagenesis  was  performed  using  the  MORPH  site- 
specific  plasmid  DNA  mutagenesis  kit  as  recommended  by  the  suppUer  (5  Prime->  3 
Prime).  Double-stranded  plasmid  pMjPANl  was  used  as  a  template  for  annealing  of  the 
oligonucleotide  5'-GAGCTTGAACG7;GA^AATTTACAGTTAATG-3',  where  the  bold 
italicized  bases  are  the  conservative  codon  replacements  used  to  alter  the  internal 
ribosomal  binding  site  of  ORF  MJl  176.  Plasmid  DNA  was  treated  with  Dpnl  to  destroy 
double-stranded  methylated  non-mutagenized  plasmids.  Mutagenized  plasmids  were 
transformed  into  E.  coli  strain  MORPH  mutS  (Table  3),  and  mutations  were  confirmed  by 
DNA  sequence  analysis  of  plasmid  pMjPAN6  (Table  4). 

Expression  of  the  M.  jannaschii  and  M.  maripaludis  pan  Genes  in  E.  coli 

The  M.  jannaschii  and  M.  maripaludis  pan  genes  which  encode  the  PAN  protein 
were  expressed  in  E.  coli  BL21(DE3)  using  the  bacteriophage  T?  RNA  polymerase- 
promoter  system  with  plasmids  listed  in  Table  4  (Novagen,  Madison,  WI)  (238).  An  E. 
coli  host  strain  containing  the  //eJf  and  argU  genes,  encoding  tRNAAUAand 
tRNAAGA/AGG,  on  plasmid  pSJS1240  (123),  was  transformed  with  appropriate  expression 
plasmids  to  maximize  the  yield  of  recombinant  M  jannaschii  and  M.  maripaludis 
proteins.  Transformed  cells  were  inoculated  into  LB  medium  supplemented  with 
appropriate  antibiotics  and  grown  at  37°C  and  200  RPM  until  the  cells  reached  mid-log 
phase  (Aaoonm  ~  0.7  units).  The  T7  RNA  polymerase-dependent  transcription  was  induced 
with  0.4  mM  IPTG  for  2  h.  Cells  were  harvested  by  centrifugation  at  5,000  x  g  for  1 5  min 
at  4°C,  and  the  cell  material  was  either  used  immediately  or  stored  at  -70°C. 
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Purification  of  MjPAN  Proteins  from  Recombinant  E.  coli 
For  purification  of  N-terminal  poly-histidine-tagged  MjPAN  (Hise-MjPAN),  E. 
coli  (pMjPANS)  cells  were  thawed  in  6  volumes  (wt/vol)  of  20  mM  Tris  buffer  at  pH  7.9 
containing  5  mM  imidazole  and  passed  through  a  French  pressure  cell  at  20,000  lb/in  . 
This  was  followed  by  centrifligation  at  16,000  x  g  for  20  min  at  4°C.  The  extract  was 
applied  to  a  Ni^"^-Sepharose  column  (4.8  by  0.8  cm)  (Pharmacia)  equilibrated  with  20 
mM  Tris  buffer  at  pH  7.9  containing  5  mM  imidazole  and  then  washed  with  10  ml  of  20 
mM  Tris  buffer  at  pH  7.9  containing  60  mM  imidazole.  The  His6-MjPAN  was  eluted 
with  10  ml  of  20  mM  Tris  buffer  at  pH  7.9  containing  500  mM  imidazole,  pooled,  and 
dialyzed  against  25  mM  Tris-pH  7.5  buffer.  Fractions  with  ATP  hydrolyzing  activity  at 
60°C  were  pooled  and  determined  to  be  pure  by  reducing  SDS-PAGE  (141). 

For  purification  of  MjPAN  proteins  without  the  His6-tag,  cell-free  lysate  was 
generated  using  E.  coli  (pMjPAN2,  pMJPAN7,  or  pMJPAN8)  cells  as  described  above 
with  20  mM  Tris  buffer  at  pH  8.0  containing  1  mM  DTT  (Tris-pH  8.0-buffer)  and  passed 
through  a  French  pressure  cell  at  20,000  Ib/in^.  The  lysate  was  heated  to  85°C  for  15  min, 
chilled  to  0°C  for  15  min,  and  centrifiiged  at  10,000  x  g  for  30  min  at  4°C  which  removed 
the  majority  of  E.  coli  proteins.  Samples  were  applied  to  a  HiLoad  Q-Sepharose  26/10 
column  (2.5  by  28.5  cm)  (Pharmacia)  equilibrated  in  Tris-pH  8.0-buffer,  and  the  proteins 
were  eluted  with  a  linear  NaCl  gradient  (0  to  500  mM  NaCl  in  240  ml  of  Tris-pH  8.0- 
buffer)  at  375  mM  NaCl.  Fractions  which  hydrolyzed  ATP  at  65°C  were  pooled, 
concentrated  by  dialysis  against  PEG8000,  and  applied  to  a  Superose  6  HR  10/30  column 
equilibrated  with  Tris-pH  8.0-buffer  with  150  mM  NaCl  and  10  %  glycerol.  Protein 
fractions  with  ATPase  activity  were  pooled  and  applied  to  a  QHyperD  (BioRad)  or 
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MonoQ  HR  5/5  column  (Pharmacia)  equilibrated  with  Tris-pH  8.0-buffer  with  100  mM 
NaCl.  The  proteins  were  eluted  with  a  linear  NaCl  gradient  (100  to  500  mM  NaCl  in  10 
ml  of  Tris-pH  8.0-buffer)  at  370  mM  NaCl.  Fractions  which  hydrolyzed  ATP  at  65EC 
were  pooled.  Full-length  MjPAN  protein  was  further  purified  by  application  to  a  CHT5- 
IB  hydroxyapatite  column  (BioRad)  equilibrated  with  10  mM  sodium  phosphate  buffer  at 
pH  7.5  containing  1  mM  DTT  and  10%  glycerol  (sodium  phosphate-pH  7.5  buffer)  and 
elution  with  a  linear  gradient  (10  to  250  mM  sodium  phosphate-pH  7.5  buffer  in  30  ml). 
The  MjPAN  protein,  which  eluted  at  225  mM  sodium  phosphate-pH  7.5  buffer,  was 
dialyzed  against  50  mM  Tris  buffer  at  pH  7.5  containing  100  mM  NaCl,  1  mM  DTT,  and 
1 0%  glycerol  at  4°C  for  1 8  h.  The  protein  was  then  applied  to  a  Sephadex  G-25  (Sigma) 
column  equilibrated  with  50  mM  Tris  buffer  at  pH  7.5  containing  100  mM  NaCl,  1  mM  . 
DTT,  and  10%  glycerol  to  remove  all  detectable  Pi.  The  MjPAN  and  MjPAN(Al-73) 
proteins  were  determined  to  be  pure  by  reducing  SDS-PAGE  (141)  and  stored  under 
liquid  N2  without  apparent  loss  of  activity. 

Purification  of  Genomic  DNA  from  M.  maripaludis 

A  100  ml  late  log  culture  of  M  maripaludis  in  MG  medium  was  centrifiaged  at 
5,000  X  g  for  10  minutes  at  4°  C.  The  cell  pellet  was  then  resuspended  in  5  ml  TE/sahs 
buffer  (10  mM  Tris,  100  mM  EDTA,  5  mM  MgCb,  and  50  mM  NaCl,  pH  8.0)  containing 
the  same  concentration  of  salt  as  in  the  growth  media.  Genomic  DNA  was  then  isolated 
following  procedures  according  to  K.  R.  Sowers  (235). 

Southern  Analysis  of  M.  maripaludis  Genomic  DNA 

Genomic  DNA  from  M.  maripaludis  was  digested  with  several  restriction 
enzymes,  including  £coRV,  Xbal,  and  Smal,  separated  by  0.8%  agarose  gel 
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electrophoresis,  and  transferred  for  16  h  by  upward  capillary  action  to  a  positively- 
charged  nylon  membrane  (234).  After  UV-crosslinking,  the  membranes  were  equilibrated 
in  hybridization  buffer  (5X  SSC,  0.1%  N-lauroylsarcosine,  0.02%  SDS,  1%  blocking 
reagent)  for  4h  at  68°C,  where  20X  SSC  solution  consists  of  3M  NaCl  and  0.3M  sodium 
citrate  at  pH  7.0.  The  blot  was  then  probed  with  the  1071 -bp  M  jannaschii  pan  (A  1-73) 
gene  fragment  (Mj/>a«  probe),  which  had  been  random-prime  labeled  with  DIG-dUTP 
following  procedures  recommended  by  the  manufacturer  (Roche  Biochemicals). 
Hybridization  was  performed  with  the  M^pan  probe  at  a  final  concentration  of  20  ng  per 
ml  for  14  h  at  68°  C.  The  membrane  was  then  washed  in  2X  SSC  buffer  containing  0.1% 
SDS  (2x5  min),  and  in  0.5X  SSC  buffer  containing  0.1%  SDS  (2x15  min)  at  68°  C. 
Signals  were  then  visualized  colorimetrically  using  alkaline  phosphatase-conjugate 
antibody  raised  against  DIG  according  to  instructions  provided  by  the  supplier  (Roche 
Biochemicals). 

Construction  of  a  M  maripaludis  Random  Genomic  X  Librziry 
A  random  genomic  X  library  was  constructed  using  M  maripaludis  genomic 
DNA  which  had  been  partially  digested  with  the  endonuclease  5'aw3AI.  The  procedures 
were  as  recommended  by  the  manufacturer  (Novagen)  with  the  following  modifications. 
After  partial  fill-in  of  the  5'flM3  Al-digested  cohesive  ends,  the  DNA  was  loaded  onto  a 
0.8%  agarose  gel  and  size  fractionated  to  isolate  7-  to  20-kbp  fragments  for  packaging 
into  A,BlueStar  phage.  After  excision  of  the  agarose  gel  piece  containing  fragments  of  the 
desired  size,  the  DNA  was  electroeluted  according  to  Peloquin  and  Platzer  (187).  The 
DNA  fragments  were  then  ligated  into  ^-BlueStar  Xho\-d\ges\.ed  vector  arms,  added  to 
X  phage  packaging  extract,  and  plated  onto  the  host  strain  ER1647  (Table  3).  After 


amplification  of  the  library  according  to  the  procedures  of  the  manufacturer  (Novagen), 
the  X  library  was  stored  in  SM  buffer  (50  mM  Tris  at  pH  7.5  containing  100  mM  NaCl,  8 
mM  MgS04,  and  0.01%  gelatin)  in  0.5  ml  aliquots  containing  7%  DMSO  at  -70°  C. 
Isolation  and  Cloning  of  the  Mmpan  Gene  from  M  maripaludis 
After  plating  the  phage  from  the  M  maripaludis  genomic  A,  library,  the  phage 
were  grown  for  14  h  at  37°  C  on  LB  and  then  plates  were  chilled  at  4°C  for  1  h.  Plaques 
were  then  lifted  onto  nylon  membranes,  treated  with  denaturation  solution  (1.5M  NaCl, 
0.5M  NaOH)  for  5  min,  and  neutralized  with  1.5M  NaCl  and  IM  Tris  (pH  7.5)  for  15 
min.  After  treatment  with  2X  SSC  for  15  min,  the  DNA  was  fixed  to  the  wet  membranes 
by  UV-crosslinking.  The  membranes  were  then  probed  with  the  M]pan  probe,  using  the 
hybridization  conditions  and  detection  methods  described  above.  Plasmid  subclones  of 
positive  plaques  were  obtained  by  automatic  Cre-lox-mediated  subcloning  using  the  host 
strain  BM25.8  (Table  3).  The  BM25.8  strain  expresses  the  Cre  recombinase  which  will 
excise  the  X,  vector  at  two  loxP  sites  by  site-specific  recombination.  This  yields  a  pUC- 
derived  plasmid  containing  the  genomic  insert  of  interest,  a  gene  conferring 
ampicillin/carbenicillin  resistance,  and  an  E.  coli  origin  of  replication.  Following  the 
procedures  recommended  by  the  manufacturer  (Novagen),  the  X  phage  was  adsorbed  to 
host  strain  BM25.8  and  then  the  host/phage  mixture  was  plated  onto  LB  agar  plates 
containing  50  mg  per  liter  carbenicillin.  Plasmid  replication  and  copy  number  in  strain 
BM25.8  is  very  low;  therefore,  selected  plasmid  subclones  were  isolated  and  re- 
transfomed  by  electroporation  into  the  E.  coli  host  strain  DH5a.  After  isolation  of  the 
plasmid  DNA  from  these  clones.  Southern  analysis  was  performed  using  identical 
conditions  as  described  above  for  analysis  of  M.  maripaludis  genomic  DNA. 
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Cloning  of  the  Mmpan  Gene  into  E.  coli  Expression  Vectors 
Plasmid  pJAM721  was  used  as  a  template  in  polymerase  chain  reactions  to 
amplify  DNA  fragments  carrying  the  M  maripaludis  ORF.  The  PCR  buffer  conditions 
were  as  described  for  M]pan  except  that  55  ng  of  pJAM721  was  used  as  a  template. 
Cycling  conditions  were  1  min  at  94°,  2  min  at  50°C,  and  2  min  at  72°  C  (for  40  cycles). 
The  DNA  fragments  generated  by  PCR  amplification  were  cloned  into  the  expression 
vectors  pETl  5b  (for  N-terminal  His6-tagged  construct)  or  pET24b  (for  native  and  C- 
terminal  Hise-tagged  constructs)  using  Ndel  and  BamWl  ox  Xhol  (Table  4).  The  DNA 
sequences  of  the  fragments  generated  by  PCR  amplification  were  verified  by  DNA 
sequence  analysis  of  both  strands  of  plasmids  pJAM727,  pJAM728,  and  pJAM730. 

To  verify  the  start  site  of  the  Mmpan  gene,  the  5'  region  was  amplified  directly 
from  genomic  DNA  to  construct  plasmid  pJAM745.  The  PCR  buffer  conditions  were  the 
same  as  those  described  above  except  that  300  ng  of  genomic  DNA  was  used  as  a 
template,  and  the  annealing  temperature  was  55°C.  After  PCR,  the  fragments  were 
directly  cloned,  via  a  topoisomerase  1-mediated  direct  ligation,  into  the  pCR-TOPO 
vector  using  procedures  as  recommended  by  the  supplier  (Invitrogen).  The  start  site  of 
the  Mmpan  ORF  was  then  verified  by  DNA  sequence  analysis  of  both  strands  of 
pJAM745  (Table  4). 

Purification  of  MmPAN  Proteins  from  Recombinant  E.  coli 
For  purification  of  N-  and  C-terminal  poly-histidine-tagged  MmPAN  (N-  and  C- 
His6-MmPAN)  proteins,  E.  coli  (pJAM728  and  pJAM730)  cells  were  thawed  in  8  and  6.5 
volumes  (wt/vol),  respectively,  of  20  mM  Tris  buffer  at  pH  7.5  containing  100  mM  NaCl 
and  5  mM  imidazole  and  passed  through  a  French  pressure  cell  twice  at  20,000  Ib/in^. 
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This  was  followed  by  centrifligation  at  15,000  x  g  for  20  min  at  4°C.  All  samples  and 
buffers  were  filtered  through  a  0.45|im  filter.  Samples  (48.4  mg  total  for  N-His6-MmPAN 
and  74  mg  total  for  C-His6-MmPAN)  were  applied  to  Ni^"^-Sepharose  columns  (4.8  by 
0.8  cm)  (Pharmacia)  which  had  been  equilibrated  with  20  mM  Tris  buffer  at  pH  7.5 
containing  100  mM  NaCl  and  5  mM  imidazole  and  then  washed  with  10  ml  of  20  mM 
Tris  buffer  at  pH  7.5  containing  100  mM  NaCl  and  60  mM  imidazole.  The  His6-MmPAN 
proteins  were  eluted  with  10  ml  of  20  mM  Tris  buffer  at  pH  7.5  containing  100  mM  NaCl 
and  500  mM  imidazole  and  pooled.  For  the  C-His6-MmPAN  construct  only,  0.63  mg  of 
the  protein  was  then  applied  to  a  Superose  6  size  exclusion  column  equilibrated  with  25 
mM  Tris  buffer  at  pH  7.5  containing  150  mM  NaCl,  1  mM  DTT,  and  10%  glycerol.  For 
purification  of  MmPAN  proteins  without  the  His6-tag,  E.  coli  (pJAM727)  cells  were 
thawed  in  7  volumes  (wt/vol)  of  20  mM  Tris  buffer  at  pH  7.5  containing  150  mM  NaCl, 
1  mM  DTT,  and  10%  glycerol  (resuspension  buffer)  and  passed  through  a  French 
pressure  cell  twice  at  20,000  lb/in  .  This  was  followed  by  centrifligation  at  16,000  x  g  for 
20  min  at  4EC.  Cell-free  lysate  (5  mg)  was  then  applied  to  a  Superose  6  HR  10/30 
column  which  had  been  equilibrated  in  resuspension  buffer.  Fractions  (300-  to  600-kDa) 
which  hydrolyzed  ATP  at  37°C  were  then  pooled  and  applied  to  a  MonoQ  HR  5/5 
column  (Pharmacia)  which  had  been  equilibrated  in  25  mM  Tris  buffer  at  pH  7.5 
containing  150  mM  NaCl,  1  mM  DTT,  and  10%  glycerol.  The  proteins  were  eluted  with 
a  linear  NaCl  gradient  from  150  mM  to  1  M  NaCl  in  20  ml  of  buffer.  In  all  purification 
steps,  the  purity  of  protein  was  measured  by  reducing  SDS-PAGE. 
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Cloning  of  the  Mmpan  and  M\pan  Genes  into  pWLG40  Expression  Vectors 

Plasmids  pJAM727,  pJAM728,  and  pJAM730  were  used  as  templates  for  PGR 

amplification  of  the  Mmpan  ORF  as  well  as  the  downstream  Ty-terminator  sequences 

contained  on  the  pETlSb  and  pET24b  vectors.  The  PCR-generated  fragments  were 

directly  cloned,  via  a  topoisomerase  I-mediated  ligation,  into  pCR-TOPO  vectors  using 

procedures  recommended  by  the  supplier  (Invitrogen).  After  DNA  sequence  analysis, 

Pstl-to-Bglll  fragments  from  the  pCR-TOPO  vectors  were  cloned  into  Nsil-Bglll  cut 

pWLG40  vectors  to  construct  the  plasmids  pJAM737,  pJAM738,  and  pJAM739  (Table 

4).  Since  the  Mmpan  gene  contains  an  Nsil  site,  the  compatible  restriction  enzyme  Pstl 

was  used  rather  than  Nsil  for  cloning  of  the  Mmpan  gene  into  pWLG40.  The  start  codon 

was  thus  8  bp,  rather  than  5  bp,  away  from  the  Shine-Dalgamo  sequence  in  the  cloning 

region  of  the  pWLG40  vector. 

Identical  procedures  to  those  described  above  were  used  to  clone  the  Mjpan  ORF 

into  the  pWLG40  vector,  with  the  following  differences.  The  pMjPANS,  pMjPAN7, 

pJAM735,  and  pJAM736  plasmids  were  used  as  templates  in  PGR  reactions.  The  Nsil-to- 

Bglll  fragments  were  then  cloned  into  the  pWLG40  vector  to  generate  the  expression 

plasmids  pJAM740,  pJAM741,  pJAM742,  and  pJAM743  (Table  4). 

Synthesis  of  MmPAN  and  MjPAN  Proteins  in  Recombinant  M  maripaludis  and 
Purification  of  C-terminal  Hisg-tagged  MmPAN 

M  maripaludis  strains  containing  pJAM737,  pJAM738,  pJAM739,  pJAM740, 

pJAM741,  pJAM742,  and  pJAM743  expression  plasmids  (Table  4)  were  grown  in  McC 

medium  containing  2.5  mg  per  liter  puromycin.  Expression  of  MmPAN  and  MjPAN 

proteins  was  monitored  by  SDS-PAGE  and  immunoblotting  using  anti-MjPAN 

antibodies.  The  purification  of  G-terminal  His6-tagged  MmPAN  was  performed  at  20°G 
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to  25°C  in  an  anaerobic  chamber  (Coy  Laboratory  Products,  Ann  Arbor,  Mich.) 
containing  N2:H2  (19:1).  All  solutions  were  filtered  through  a  0.45nni  filter  and  made 
anoxic  as  previously  described  (249).  The  M.  maripaludis  cells  harboring  pJAM738 
(Table  4)  and  pWLG40  +  lacZ  (as  a  control)  were  centrifiiged  anaerobically  at  2,500  x  g 
for  15  min  and  washed  two  times  in  20  mM  Tris  buffer  at  pH  7.4  containing  375  mM 
NaCl,  5  mM  imidazole,  and  10%  glycerol  (5  mM  imidazole  buffer).  The  cell  suspension 
was  then  passed  through  a  French  pressure  cell  at  20,000  IbW  and  centrifiiged  at  15,000 
X  g  for  15  min  at  4°C.  Cell  lysate  from  M  maripaludis  (pJAM738)  and  M  maripaludis 

9+ 

(pWLG40+  lacZ)  was  then  applied  to  separate  Cu  -Sepharose  columns  (4.8  by  0.8  cm) 
(Pharmacia)  equilibrated  with  5  mM  imidazole  buffer.  The  columns  were  washed  with  1 0 
ml  of  20  mM  Tris  buffer  at  pH  7.4  containing  375  mM  NaCl,  60  mM  imidazole,  and  10% 
glycerol.  The  Hise-MmPAN  proteins  were  eluted  with  10  ml  of  20  mM  Tris  buffer  at  pH 
7.4  containing  375  mM  NaCl,  500  mM  imidazole,  and  10%  glycerol.  The  C-Hise- 
MmPAN  (0.45  mg)  was  then  further  purified  by  application  to  a  Superose  6  size 
exclusion  colunrn  equilibrated  with  20  mM  Tris  buffer  at  pH  7.4  containing  250  mM 
NaCl,  5  mM  DTT,  and  10%  glycerol.  In  all  purification  steps,  the  purity  of  protein  was 
measured  by  reducing  SDS-PAGE. 

Partial  Purification  of  MjPAN  from  M.  jannaschii 
The  M.  jannaschii  strain  JAL-1  was  grown  to  mid-log  phase  in  MSH  medium 
under  an  80:20  H2:C02  gas  mix  at  40  Ib/in^  according  to  Boone  et  al.  (20).  M.  jannaschii 
cultures  were  chilled  to  0°C,  and  the  cells  were  harvested  aerobically  by  centrifugation  at 
5,000  X  g  for  15  min  at  4°C.  Cell  material  (0.5  g)  was  stored  at  -70°C.  Cells  were  thawed 
in  17  volumes  (wt/vol)  of  50  mM  Tris  buffer  at  pH  8.0  with  150  mM  NaCl,  10  mM 
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MgCb,  1  mM  ATP,  10  mM  p-mercaptoethanol,  10  %  glycerol,  and  0.1  %  Triton  X-100. 
The  cell  suspension  was  passed  through  a  French  pressure  cell  twice  at  1 0,000  Ib/in^  and 
then  centrifuged  at  1 5,000  x  g  for  30  min  at  4°C.  Cell  lysate  (2  mg)  was  applied  to  a 
Superose  6  HR  10/30  column  equilibrated  with  Tris-pH  8.0-buffer  with  150  mM  NaCl 
and  10  %  glycerol.  Fractions  with  NEM-inhibitable  ATPase  activity  were  applied  to  a 
MonoQ  HR  5/5  column  equilibrated  with  Tris-pH  8.0-buffer  with  100  mM  NaCl  and 
10%  glycerol.  The  proteins  were  eluted  with  a  linear  NaCl  gradient  (0.1  to  1  M  NaCl  in 
15  ml  of  Tris-pH  8.0-buffer).  In  some  experiments,  ion-exchange  chromatography 
preceded  gel  filtration.  Protein  fractions  were  analyzed  for  NEM-inhibitable  hydrolysis  of 
ATP  and  immunologically  with  MjPAN  antibody  after  denaturing  and  reducing  12% 
SDS-PAGE  (90).  Protein  was  also  analyzed  by  immunoblot  of  proteins  separated  by  non- 
denaturing  gel  electrophoresis  which  were  transferred  by  capillary  action  to  PVDF 
membranes  using  10  mM  MES  buffer  at  pH  6.0  with  20%  methanol.  Western  analysis 
was  performed  as  described  above  using  anti-MjPAN  antibodies. 

Purification  of  a  20S  Proteasome  and  2  NTPases  from  M  maripaludis: 
The  M  maripaludis  cell  material  provided  by  Brian  Waters  and  William  Whitman 
of  the  University  of  Georgia  was  prepared  as  follows.  Mid-  to  late-log  phase  M. 
maripaludis  cultures  (A60o=0.85-0.9)  grown  in  McN  media  were  harvested  anaerobically 
using  an  Alfa  Laval  Sharpies  (Model  T-IP)  centrifuge.  The  cell  paste  was  then  stored 
anaerobically  at  -70°  C  in  60  ml  Wheaton  bottles  which  had  been  pre-flushed  with  N2. 
For  purification  of  the  20S  proteasome  and  NTPase  proteins,  cells  were  thawed  in  8 
volumes  (wt/vol)  of  25  mM  Tris  buffer  at  pH  7.5  with  5  mM  DTT  and  10%  glycerol.  The 
cell  resuspension  was  passed  through  a  French  pressure  cell  twice  at  20,000  Ib/in^  and 
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then  centrifuged  at  15,000  x  g  for  20  min  at  4°C.  Cell  lysate  (20  mg  protein  per  ml)  was 
applied  to  a  Q  Sepharose  26/10  column  (2.5  by  28.5  cm)  (Pharmacia)  equilibrated  in  25 
mM  Tris  buffer  at  pH  7.5  with  5  mM  DTT  and  10%  glycerol,  and  the  proteins  were 
eluted  with  a  linear  NaCl  gradient  (0  to  1  M  NaCl  in  400  ml  buffer). 

For  purification  of  the  20S  proteasome,  fractions  which  hydrolyzed  LLVY-AMC 
were  pooled,  concentrated  by  dialysis  against  PEG8000  to  a  concentration  of  6.6  mg  per 
ml,  and  applied  to  a  Superose  6  HR  10/30  column  equilibrated  with  25  mM  Tris  buffer  at 
pH  7.5  with  150  mM  NaCl,  5  mM  DTT,  and  10%  glycerol.  Fractions  with  activity  were 
pooled,  dialyzed  in  sodium  phosphate  buffer  at  pH  7.5  containing  5  mM  DTT  and  10% 
glycerol  (dialysis  buffer),  and  applied  to  a  CHT5-1B  hydroxyapatite  column  (BioRad) 
equilibrated  with  the  dialysis  buffer.  The  20S  proteasome  was  eluted  (at  500  mM  sodium 
phosphate)  with  a  linear  gradient  from  15  mM  to  IM  sodium  phosphate-buffer  containing 
5  mM  DTT  in  40  ml.  Fractions  with  activity  were  pooled  and  dialyzed  in  25  mM  Tris 
buffer  at  pH  7.5  containing  150  mM  NaCl,  5  mM  DTT,  and  10%  glycerol,  and  applied  to 
a  MonoQ  HR  5/5  column  equilibrated  in  the  same  buffer.  The  20S  proteasome  was 
concentrated  and  eluted  in  25  mM  Tris  buffer  containing  700  mM  NaCl,  5  mM  DTT,  and 
10%  glycerol. 

For  partial  purification  of  the  two  NTPase  proteins,  Q  Sepharose  26/10 
chromotagraphy,  PEG-mediated  concentration,  and  Superose  6  gel  filtration  were  as 
described  for  purification  of  the  20S  proteasome.  Two  fractions  which  hydrolyzed  ATP 
(NTPase  #1  and  NTPase  #2)  were  identified.  These  fractions  were  loaded  onto  a  MonoQ 
5/5  column  which  was  equilibrated  in  25  mM  Tris  at  pH  7.5  containing  150  mM  NaCl,  5 
mM  DTT,  and  10%  glycerol.  Proteins  were  eluted  with  a  linear  gradient  from  150  mM  to 


1  M  NaCl  in  20  ml.  The  NTPase  #1  eluted  at  275  mM  NaCl,  and  NTPase  #2  eluted  at 
360mMNaCl. 

Production  of  Antibodies  against  MiPAN(Al-73) 
The  recombinant  MjPAN(Al-73)  protein  of  M.  jannaschii  was  applied  to  a 
reducing  SDS-12%  polyacrylamide  gel,  and  a  gel  fragment  containing  the  protein  (300 
\ig)  was  used  to  generate  polyclonal  antibodies  in  rabbits  with  Freund's  adjuvant 
according  to  supplier  (Cocalico  Biologicals,  Reamstown,  PA). 

Non-denaturing  Gel  Electrophoresis  of  PAN  Proteins 
The  PAN  complexes  were  separated  electrophoretically  under  non-denaturing 
conditions  using  a  5%  resolving  and  4%  stacking  gel  of  375  mM  Tris-HCl  at  pH  8.8 
(BioRad).  The  non-denaturing  rurming  buffer  contained  25  mM  Tris,  192  mM  glycine,  10 
mM  MgCb,  and  1  mM  ATP  at  pH  8.3.  The  Mg^^-dependent  ATPase  activity  of  proteins 
was  visualized  in  the  gel  as  previously  described  (189)  with  the  following  modifications. 
After  separation  of  the  proteins  by  electrophoresis,  the  gels  were  incubated  with  or 
without  10  mM  NEM  at  37°  or  55°C  in  50  mM  Tris  buffer  at  pH  8.0  containing  1  mM 
ATP,  10  mM  MgCl2,  100  mM  NaCl,  and  0.1  mM  lead  nitrate  until  a  white  lead 
phosphate  precipitate  formed.  The  gel  was  washed  in  deionized  water  (3x5  min)  and 
then  briefly  incubated  in  1%  (w/v)  sodium  sulfide,  which  rapidly  converted  the  white 
precipitate  to  a  brown  precipitate  of  lead  sulfide. 

Electron  Microscopy  of  Archaeal  Holo-Proteasomes 
The  M.  jannaschii  PAN,  20S  proteasome,  and  equimolar  mixtures  of  the  two 
protein  complexes  purified  from  recombinant  E.  coli  were  incubated  at  21°C  or  50°C  for 
15  min  in  25  mM  TES  buffer  at  pH  8.0  containing  100  mM  NaCl,  10  mM  MgCb,  and  1 
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mM  ATP.  Protein  samples  were  placed  on  200-mesh  grids  coated  with  Formvar  or 
carbon  film,  fixed  with  2%  cacodylate-buffered  glutaraldehyde,  briefly  stained  with  1% 
aqueous  uranyl  acetate,  and  treated  with  0.01%  bacitracin  to  improve  spreading  (81). 
Samples  were  viewed  and  photographed  using  a  Zeiss  EM- 1  OCA  transmission  electron 
microscope  operated  at  80  kV. 

Protein  Sequence  Determinations 

The  N-terminal  sequences  of  the  MjPAN  and  MjPAN(Al-73)  proteins  fi-om 
recombinant  E.  coli  were  determined  from  purified  protein  which  had  been  separated  by 
SDS-PAGE  and  electroblotted  onto  a  polyvinylidene  difluoride  (Immobilon-P) 
membrane  (Millipore).  All  amino  acid  sequencing  was  by  automated  Edman  degradation 
(51)  and  was  performed  at  the  University  of  Florida  Interdisciplinary  Center  for 
Biotechnology  Research  protein  chemistry  core  facility. 

DNA  Sequence  Determinations 

The  Sanger  dideoxy  chain-termination  method  was  used  to  sequence  all  DNA 
(207)  and  was  performed  at  the  University  of  Florida  Department  of  Microbiology  and 
Cell  Science  DNA  Sequencing  Facility. 

Protein  Sequence  Analysis 

GenBank,  EMBL,  and  SwissProt  data  bases  were  searched  at  the  National  Center 
for  Biotechnology  Information  (NCBI;  Bethesda,  MD)  using  the  BLAST  network  server 
(7).  The  CLUSTALW  program  version  1.8  (250)  or  Multialin  was  used  for  alignment  of 
protein  sequences  accessible  through  NCBI.  Secondary  structure  analysis  was  performed 
using  the  DSC  structure  prediction  program  (125). 


RESULTS  AND  DISCUSSION 
Synthesis  of  M.  jannaschii  20S  Proteasome  and  PAN  in  Recombinant  E.  coli 
Three  ORFs  which  encode  putative  components  of  an  energy-dependent 
proteasome  were  identified  from  the  genome  sequence  of  the  methanoarchaeon  M 
jannaschii  (26).  These  ORFs  were  MJ0591  (encoding  a  putative  a  subunit  of  the  20S 
proteasome  with  61%  identity  and  79%  similarity  to  M.  thermophila  PsmA),  MJ1237 
(encoding  a  putative  p  subunit  of  the  20S  proteasome  with  48%  identity  and  1\% 
similarity  to  M.  thermophila  PsmB)  and  MJl  176  (encoding  a  putative  AAA  subunit  with 
49%  identity  and  69%)  similarity  to  Rpt2p  from  H.  sapiens).  These  ORFs  were  amplified 
using  PGR  with  appropriate  primers  and  placed  into  expression  vectors  for  production  of 
the  proteins  in  recombinant  E.  coli.  When  the  E.  coli  host  strain  BL21(DE3)  was  used  for 
production  of  these  M.  jannaschii  proteins,  the  yield  of  recombinant  protein  was  less  than 
0.5%)  of  the  total  cell  protein  as  determined  by  SDS-PAGE  and  Western  blot  analysis. 
Examination  of  the  codon  composition  of  the  three  ORFs  revealed  that  up  to  12%  of  the 
codons  required  either  tRNAAUA  or  tRNAACA/AGO  for  translation,  which  are  rare  in  E.  coli 
(162).  Therefore,  a  compatible  plasmid  pSJS1240  (Table  4)  with  the  E.  coli  //eJf  and 
argU  genes  encoding  these  rare  tRNAs  was  introduced  into  the  host  (124).  In  the 
presence  of  the  plasmid  pSJS1240,  the  overall  yield  of  recombinant  M.  jannaschii  protein 
was  as  high  as  -10%)  of  total  soluble  E.  coli  proteins  indicating  that  the  rare  tRNAs  limit 
the  high  level  production  of  these  proteins  in  E.  coli. 
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M.  famaschii  20S  proteasome.  The  genes  encoding  the  a  and  (3  subunits  of  the  M. 
jannaschii  20S  proteasome  were  expressed  independently  in  recombinant  E.  coll  The  a 
and  p  proteins  assembled  into  a  700-kDa  complex  based  on  gel  filtration  with  an  overall 
structure  similar  to  other  20S  proteasomes  based  on  electron  microscopy  (287).  The 
observed  sizes  of  the  a  and  P  subunits  were  larger  than  the  calculated  molecular  masses 
deduced  from  the  M.  jannaschii  genome  sequence.  However,  immunoblot  analysis  of  M. 
jannaschii  cell  lysate  confirmed  that  both  the  a  and  p  subunits  of  the  20S  proteasome 
synthesized  in  M.  jannaschii  were  of  the  same  apparent  molecular  weight  as  the  purified 
recombinant  proteins  (Fig.  2).  Therefore,  MJ0591  and  MJ1237  were  designated  the  psmA 
and  psmB  genes  (proteasome  A  and  B)  which  is  consistent  with  the  nomenclature  of  the 
related  genes  from  M.  thermophila  (157). 

M.  jannaschii  proteasome-activating  nucleotidase  (MjPAN)  protein.  Expression 
of  ORF  MJl  176  in  recombinant  E.  coli  (pMjPAN2)  resulted  in  the  high-level  production 
of  47-,  39-,  and  30-kDa  proteins  which  were  not  apparent  in  the  absence  of  the  plasmid 
(Fig.  2).  These  results  indicated  that  production  of  MjPAN,  which  has  a  calculated 
molecular  mass  of  47-kDa,  may  have  induced  the  overproduction  of  E.  coli  proteins  such 
as  chaperones  or  other  stress-response  proteins.  Alternatively,  MjPAN  may  be 
hydrolyzed  or  the  ORF  may  encode  proteins  with  different  sizes.  The  47-  and  39-kDa 
proteins  were  stable  after  heating  the  cell  lysate  to  85°  C.  In  addition,  they  co-purified 
during  anion  exchange,  hydroxyapatite,  and  gel  filtration  chromotagraphy  as  a  ~600-kDa 
fraction  which  catalyzed  the  hydrolysis  of  ATP  at  a  rate  of  1,800  nmol  Pj  released  •  min"' 
•  mg  protein''  at  65°  C  (Fig.  3,  Lane  1).  These  two  proteins  were  further  analyzed  using 
N-terminal  Edman  degradation.  The  first  10  residues  of  the  47-kDa  and  39-kDa  proteins 
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FIG.  2.  Western  blot  analysis  of  M.jannaschii  cell  lysate  showing  that  a  and  p 
subunits  of  the  20S  proteasome  and  the  full-length  PAN  protein  are  produced  in 
M.jannaschii.  Proteins  were  separated  by  12%  reducing  SDS-PAGE  and  analyzed  by 
Western  blot.  Lane  1 :  Cell  lysate  (1-2  ^g)  of  recombinant  E.  coli  cells  producing  the 
MjPAN  proteins  of  47-,  39-,  and  30-kDa.  Lanes  2,  4,  and  6:  20  ^ig  M.  jannaschii  cell 
lysate.  Lanes  3  and  5:  2  |ig  purified  recombinant  M.jannaschii  20S  proteasome. 
Lanes  1  and  2  were  probed  with  anti-MjPAN(Al-73)  antibody.  Lanes  3  and  4  were 
probed  with  anti-Mta.  Lanes  5  and  6  were  probed  with  anti-Mtp  antibodies.  Arrows 
indicate  MjPAN,  Mja,  and  MjP  in  lanes  2,  4,  and  6  respectively.  Molecular  weight 
standards  are  indicated  on  the  left. 
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FIG.  3.  M.jannaschii  PAN  proteins  purified  from  recombinant  E.  coli. 

Protein  (2      was  separated  by  12%  reducing  SDS-PAGE  and  stained  with 
Coomassie  blue.  Lane  1:  Full-length  MjPAN  and  MjPAN(Al-73)  from 
recombinant  E.  co///pMjPAN2.  Lane  2:  His^-tagged  MjPAN  and  MjPAN(Al-73) 
from  recombinant     co///pMjPAN5.  Lane  3:  MjPAN(Al-73)  from  recombinant 
E.  co//7pMjPAN8.  Lane  4:  Full-length  MjPAN  from  recombinant  E. 
co///pMjPAN7.  Lane  5:  Molecular  weight  standards  with  masses  indicated  on 
right. 
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were  found  to  be  identical  to  residues  2  to  1 1  and  74  to  83,  respectively,  of  the  MjPAN 
protein  sequence  deduced  from  the  MJ1176  ORF  (Fig.  4).  These  results  indicate  that 
there  is  an  internal  translation  start  site  recognized  by  E.  coli  within  the  full-length  , 
mRNA  which  accounts  for  the  production  of  the  shorter  39-kDa  MjPAN(Al-73)  protein 
in  E.  coli.  In  addition,  it  is  likely  that  an  aminopeptidase  of  E.  coli  cleaved  the  N-terminal 
methionine  residue  of  the  full-length  MjPAN  protein;  whereas  the  N-terminal  methionine 
of  the  39-kDa  MjPAN  protein  was  not  removed  by  this  host  peptidase.  Similar  complex 
formation  was  observed  when  the  MjPAN  ORP  was  positioned  in-frame  to  produce  the 
full-length  MjPAN  protein  with  an  N-terminal  poly-histidine  (His6)  tag  (Table  4, 
pMJPANS).  Not  only  was  a  similarly  truncated  MjPAN  protein  produced  in  this 
recombinant  expression  system,  it  also  co-purified  with  the  full-length  Hise-tagged 
MjPAN  after  Ni^^-Sepharose  chromatography,  even  when  the  cell  lysate  was  not 
previously  exposed  to  85°C  (Fig.  3,  Lane  2).  The  ATPase  activity  of  the  Hise-tagged 
MjPAN  complex  (640  nmol  Pj  released  •  min"'  •  mg  protein"'  at  65°C)  was  over  60%  less 
than  that  of  the  untagged  MjPAN  complex  described  above.  In  addition,  the  tagged 
complex  was  less  stable  after  heating  to  85°C  or  long-term  storage  at  4°C  compared  to 
the  untagged  complex.  This  suggests  that  modification  of  the  N-terminus  of  MjPAN  by 
addition  of  a  Hise-tag  may  reduce  the  rate  of  MjPAN-mediated  ATP  hydrolysis  as  well  as 
the  stability  of  the  complex.  These  results  also  demonstrate  that  the  MjPAN(Al-73) 
protein  which  lacks  8  kDa  of  the  N-terminus  is  still  capable  of  forming  a  complex  with 
full-length  MjPAN  even  before  heating  of  the  cell  lysate  to  85°C. 

Production  and  purification  of  a  complex  containing  only  the  full-length  47-kDa 
PAN  protein.  To  purify  a  complex  containing  only  the  full-length  MjPAN  protein,  the 
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CTTTAAGAAGGAGATATACATatggtttttqaaqaatttatttcaactqaattgaagaaa 

M     VFEEFISTEL     K     K  13 

gaaaagaaagcatttactgaagaatttaaagaagaaaaggaaataaacgataattctaac 
EKKAFTEEFKEEKEINDNSN  33 

ttaaaaaatgatttacttaaagaggaactccaagaaaaggcaagaattgcagaattagaa 
LKNDLLKEELQEKARIAELE  53 

Oligo:  gagcttgaaHgHgaHaatttacagtta 
agtagaatcctaaaattagaattagagaaaaaagagcttgaaHgHgaHaatttacagtta 
SRILKLELEKKELERENLQL  73 

atg 

atgaaagaaaatgagattttaagaagagaattagatagaatgagagtccctccattgata . . . 
MKENEILRRE     LDRMRVPPL     I .  .  .  93 

FIG.  4.  Site-directed  mutagenesis  of  the  ribosomal  binding  site  located  within  the 
Mipan  gene  recognized  in  E.  colL  Capitalized  nucleotides  are  from  plasmid  pET24b, 
and  lowercase  nucleotides  are  from  the  pan  gene.  The  Shine-Dalgamo  site  is  double 
underlined.  The  nucleotides  targeted  and  the  oligonucleotide  used  for  site-directed 
mutagenesis  are  in  reverse  type  and  indicated  above  the  DNA  coding  region, 
respectively.  Amino-acid  residues  identical  to  the  N-terminal  protein  sequence 
determined  by  Edman  degradation  of  purified  47-kDa  MjPAN  species  are  underlined, 
while  those  of  the  purified  39-kDa  MjPAN  (Al-73)  species  are  underlined  and  italicized. 
Amino  acid  positions  of  the  MjPAN  protein  deduced  from  the  sequence  of  ORF  MJ1176 
are  indicated  on  the  right. 
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putative  internal  ribosomal  binding  site  of  the  ORP  (AGAGAG;  Fig.4)  which  was 
recognized  by  E.  coli  was  altered  by  site-directed  mutagenesis  at  nucleotides  A202C, 
A204T,  and  G207A  (Fig.  4).  This  removed  the  Shine-Dalgamo  consensus  sequence  while 
conserving  the  amino  acid  sequence  of  the  MjPAN  protein.  After  the  modified  gene  was 
cloned  into  a  pET24b  vector  and  expressed  in  E.  coli  (Table  4,  pMjPAN7),  the  39-kDa 
product  was  not  evident  on  either  SDS-PAGE  or  Western  blots.  A  550-kDa  complex 
containing  only  the  ftill-length  47-kDa  PAN  protein  was  then  purified  to  apparent 
homogeneity  (Fig. 3,  Lane  4). 

Production  and  purification  of  a  complex  containing  only  the  39-kDa  truncated 
MiPAN(Al-73)  protein.  In  an  attempt  to  separately  produce  the  N-terminally  truncated 
39kDa  protein,  the  ribosomal  binding  site  upstream  from  the  MJl  176  coding  sequence  on 
the  plasmid  pMjPAN7  (Table  4)  was  removed  leaving  the  fiill-length  gene  intact. 
However,  based  on  a  Western  blot,  a  small  amount  of  the  full-length  47-kDa  protein  was 
still  produced.  Therefore,  the  219-bp  of  DNA  coding  for  the  8-kDa  N-terminus  of 
MjPAN  was  removed  by  PCR  amplification  leaving  only  the  region  of  the  gene  that 
produces  the  39-kDa  species.  This  modified  gene  was  then  placed  into  pET24b  with  the 
Met^"*  ATG  start  codon  positioned  8  bp  downstream  from  the  ribosomal  binding  site  on 
the  expression  plasmid  pMjPANS  (Table  4;  Fig.  4).  The  truncated  MjPAN(Al-73)  also  ' 
purified  as  a  large  molecular  weight  complex  (~500kDa)  based  on  gel  filtration  and  non- 
denaturing  native-gel  analysis  (Fig.5),  indicating  that  the  N-terminus  is  not  required  for 
oligomerization  of  the  complex.  Thus,  the  frill-length  MjPAN  and  MjPAN(Al-73) 
proteins  were  produced  separately  and  purified  to  homogeneity,  as  determined  by 
reducing  SDS-PAGE  (Fig.  3,  Lanes  3  and  4). 
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Analysis  of  M  j'annaschii  Cell-Free  Extract  and  Partial  Purification  of  Native  MjPAN 
Cell  lysate  of  M  jannaschii  grown  on  H2  and  CO2  at  65°  C  was  analyzed  by 
immunoblot.  Western  blots  using  anti-Mta  and  anti-Mt(3  raised  against  the  subunits  of 
the  Methanosarcina  thermophila  20S  proteasome  demonstrated  that  both  the  a  and 
p  subunits  of  the  20S  proteasome  are  synthesized  in  M.  jannaschii  (Fig.  2,  Lanes  3-6).  In 
addition,  immunoblot  analysis  using  anti-MjPAN  (Al-73)  antibody  demonstrated  that 
ORFl  176  encoding  the  pan  gene  is  synthesized  in  M.  jannaschii  as  only  the  full-length 
MjPAN  protein  of  47  kDa  when  grown  under  these  conditions  (Fig.  2,  Lanes  1,2).  Partial 
purification  of  PAN  from  M.  jannaschii  cell  lysate  also  revealed  that  this  protein  is 
associated  in  a  distinct  complex  (Fig.  5,  Lane  6)  of  550-kDa  as  determined  by  gel 
filtration.  Similariy,  the  MjPAN  and  MjPAN  (Al-73)  proteins  purified  from  recombinant 
E.  coli  each  form  large  molecular-weight  complexes  (Fig.  5,  Lanes  3-5)  of  550  and  500 
kDa,  respectively,  based  on  gel  filtration.  It  remains  to  be  determined  if  the  MjPAN 
purified  from  M.  jannaschii  is  a  homooligomer  or  associates  with  additional  proteins. 

Structure  of  Recombinant  MjPAN  and  MiPAN(Al-73) 
Based  on  gel  filtration,  both  MjPAN  and  MjPAN(Al-73)  proteins  synthesized  in 
recombinant  E.  coli  formed  homooligomeric  complexes  of  550  and  500  kDa, 
respectively.  These  results  indicate  that  the  MjPAN  protein  forms  a  complex  of 
approximately  12  subunits  and  that  the  first  73  amino  acids  of  the  N-terminus  are  not 
required  for  this  association. 

Both  the  ftill-length  MjPAN  and  truncated  MjPAN(A  1  -73)  protein  complexes 
were  stable  during  purification  in  the  absence  of  ATP  but  were  readily  inactivated  when 
incubated  at  65°  C  in  glass  tubes.  The  related  ClpX  ATPase  protein  of    coli  also  has  a 
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strong  tendency  to  bind  to  glass  (288).  The  high  stability  in  the  absence  of  ATP  is  in 
contrast  to  other  related  AAA  proteins,  including  ClpA,  ClpX,  ClpQ  (HslU),  and  NEM- 
sensitive  fusion  protein  (NSF),  which  have  an  absolute  requirement  of  ATP  for  complex 
formation  and  stability  (82,  89,  118,  159).  Full-length  MjPAN  migrated  on  non- 
denaturing  polyacrylamide  gels  as  a  single-distinct  band  even  after  preincubation  at  65°  C 
(Fig.  5,  Lane  4).  This  distinct  band  catalyzed  the  hydrolysis  of  ATP  and  was  inhibited  by 
NEM,  as  determined  by  in-gel  staining  for  the  generation  of  inorganic  phosphate  (Fig.  5, 
Lanes  7  and  8).  In  contrast  to  the  related  E.  coli  HslU  AAA-ATPase  (294),  preincubation 
of  MjPAN  or  MjPAN(Al-73)  with  NEM  at  65°  C  prior  to  electrophoresis  did  not  result  in 
dissociation  of  the  complex.  Although  the  truncated  MjPAN(Al-73)  protein  also 
migrated  as  a  single  band  (Fig.  5,  Lane  3),  the  majority  of  MjPAN(Al-73)  protein  did  not 
enter  the  non-denaturing  gel,  suggesting  that  it  is  susceptible  to  aggregation  under  these 
electrophoretic  conditions.  However,  the  complex  is  stable  when  maintained  in  optimal 
buffer  and  salt  conditions  based  on  gel  filtration.  ,, 

Although  the  number  of  MjPAN  subunits  per  complex  remains  to  be  established, 
the  gel  filtration  results  suggest  that  the  complex  may  be  dodecameric.  The  conservation 
of  the  NSF-D2  oligomerization  domain  and  the  potential  12  subunit  configuration 
initially  suggested  that  the  MjPAN  protein  may  form  double-stacked  hexameric  rings 
similar  to  other  AAA  proteins  including  NSF  (89),  VAT  from  T.  acidophilum  (181),  and 
ARC  from  R.  erythropolis  (289).  Transmission  electron  micrographs  of  the  recombinant 
protein,  however,  showed  that  MjPAN  forms  particles  of  approximately  14  nm  in 
diameter  which  are  somewhat  ring-like  but  are  not  distinct  symmetrical  hexameric 
complexes  (Fig.  6).  These  results  suggest  that  MjPAN  may  not  be  stable  during 


FIG.  6.  Transmission  electron  micrograph  stereo  pairs  of  M.jannaschii  PAN 
purified  from  recombinant  E.  coli.  Several  views  of  the  comma-shaped  MjPAN 
assembly  are  seen.  Arrowhead  on  top  panel  indicates  side  view  in  which  MjPAN  most 
resembles  a  comma.  Arrowhead  in  bottom  panel  points  to  a  more  face-on  view  in 
which  the  head  of  the  comma  is  at  top.  Distinct  individual  subunits  are  seen  as  white 
globular  components  of  each  assembly.  Some  larger  clusters  most  likely  represent 
random  dimeric  and  trimeric  associations  created  when  the  sample  was  dried  for  TEM 
observation.  Bars=10  nm. 
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preparation  for  microscopy  using  uranyl  acetate  as  a  negative  stain  or  does  not  form  a 
symmetrical  complex  which  is  easily  visualized  using  these  methods. 

Biochemical  Properties  of  the  MjPAN  Complexes 
Past  studies  on  AAA  family  members,  including  the  yeast  Yhs4p/Rpt2p  (153)  and 
E.  coli  ClpA  (216, 217)  and  ClpB  (290)  proteins  suggested  that  the  N-terminus  of  PAN 
might  play  an  important  role  in  regulating  nucleotide  hydrolysis,  substrate  protein 
recognition,  and/or  other  activities.  The  MjPAN(Al-73)  complex  still  possesses  both  the 
highly  conserved  Walker  A  binding  motif  for  the  P  and  y  phosphates  of  nucleotides  (208, 
271)  and  the  Walker  B  motif  involved  in  Mg^^-binding  and  ATP  hydrolysis  (27). 
Therefore,  the  MjPAN(Al-73)  protein  provided  the  opportunity  to  examine  the  role  of  the 
highly  charged  N-terminus  on  the  biochemical  properties  of  the  MjPAN  protein.  The 
ATPase  activity  of  the  full-length  MjPAN  complex  is  highest  at  80°C  which  is  close  to 
the  growth  temperature  optimum  of  83°  C  for  this  organism  (Fig.  7A).  However,  the 
truncated  MjPAN(Al-73)  complex  was  relatively  less  heat  stable,  with  an  optimal 
ATPase  activity  at  65°C  and  a  loss  of  activity  at  higher  temperatures  (Fig.  7A).  While 
this  study  did  not  test  the  temperature  optimum  of  the  complex  formed  of  both  MjPAN 
and  MjPAN(Al-73),  the  ATPase  activity  of  the  complex  of  His6-MjPAN  and 
MjPAN(Al-73)  was  recently  shown  to  be  optimal  at  73°C  (303).  Secondary  structural 
analysis  suggests  that  an  a-helix  in  the  N-terminus  of  MjPAN,  which  contains  a  putative 
coiled-coil  region  (196,  275),  is  absent  in  the  truncated  protein.  Therefore,  the  a  helix 
may  play  an  important  role  in  stabilizing  MjPAN  at  extreme  temperatures.  Based  on 
analogy  to  other  proteins  that  are  stable  at  extreme  temperature  and/or  salt  conditions,  it 
is  possible  that  ion-pairs,  hydrophobic  clusters,  or  other  structural  elements  are  required 
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FIG.  7.  Effect  of  pH,  temperature,  and  [NaCl]  on  the  ATPase  activity  of  the  MjPAN(§) 
and  truncated  MjPAN(Al-73)  (0  )  proteins.  ATPase  activity  was  measured  as  the  release 
of  inorganic  phosphate  at  the  temperature  (A),  pH  (B),  and  salt  concentrations  (C)  indicated. 
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for  the  thermostability  of  MjPAN  and  that  these  are  disrupted  by  removal  of  the  N- 
terminal  a-helix.  , 

Both  the  full-length  and  truncated  MjPAN  proteins  have  optimal  ATPase  activity 
at  pH  7-8.  Both  were  also  more  active  at  higher  pH  (8-10)  than  at  lower  pH  (below  6) 
(Fig.  7B).  The  ATPase  activities  of  the  full-length  and  truncated  MjPAN  proteins  were 
both  stimulated  by  the  addition  of  either  KCl  or  NaCl  (Fig.  7C),  and  the  proteins  lost 
activity  when  preincubated  in  a  salt-free  buffer  prior  to  the  assay.  The  ATPase  activity  of 
full-length  MjPAN  was  stimulated  up  to  two-fold  by  NaCl  or  KCl  concentrations  to  2.5 
M,  while  activity  was  reduced  at  higher  salt  concentrations.  In  contrast,  the  MjPAN(Al- 
73)  complex  had  lower  activity  in  NaCl  or  KCl  concentrations  above  IM  (Fig.  7C).  In 
summary,  the  full-length  MjPAN  appeared  to  be  more  stable  at  both  high  salt  and  high 
temperature  conditions.  These  results  indicate  that  the  full-length  MjPAN  protein,  which 
was  also  detected  in  M  jannaschii  using  Western  blot  (Fig.  2),  is  likely  the  native  form 
synthesized  by  this  organism. 

The  ATPase  activities  of  both  PAN  proteins  were  inhibited  by  the  Mg^^-chelator 
EDTA  and  also  by  the  sulfhydryl-blocking  agents  NEM  and  ;?ara-chloromercuriphenyl- 
sulfonic  acid  (PCMS)  (Table  5).  The  MjPAN  activities  were  not  significantly  affected  by 
NaNa  (Table  5)  which  inhibits  H"^-transporting,  membrane-bound  ATPase  proteins.  These 
results  are  consistent  with  results  obtained  from  studies  of  other  proteins  of  the  AAA 
family  which  catalyze  Mg^"'-dependent  ATP  hydrolysis  and  are  inhibited  by  NEM  (11, 
57,  98,  153,  241,  289).  It  has  been  proposed  that  a  highly  conserved  cysteine  residue  in 
the  C-terminus  of  these  AAA-ATPases  {e.g.,  Cys384  of  MjPAN)  is  most  likely 
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TABLE  5.  Effect  of  various  compounds  on  the  ATPase  activity  of  the  Methanococcus 
jannaschii  PAN  and  truncated  PAN  (A  1-73)  proteins. 


Effector 

Cone. 
(mM) 

Relative  Activity  ^ 

MjPAN 

MjPAN  (Al-73) 

None 

100  ±  6 

100  ±5 

EDTA 

10 

8  ±  3 

UD*" 

SDS 

0.1 

63  ±5 

nd'^ 

0.2 

3  ±4 

nd 

NEM 

10 

UD 

15  ±4 

5 

23  ±  1 

nd 

1 

81  ±8 

nd 

PCMS 

0.05 

UD 

UD 

NaNa 

5 

90  ±2 

74  ±5 

^ATPase  activity  is  expressed  as  a  percentage  of  the  control  reaction  with  no  added 
effector.  The  enzyme  (2  ^ig  per  ml)  was  preincubated  with  or  without  the  effector  in 
lOOmM  NaCl,  lOmM  MgCb,  25mM  TES,  pH8.0  at  2\°C  for  15  min  before  addition  of 
ImM  ATP  and  incubation  at  80°C  (MjPAN)  or  65°C  (MjPAN  Al-73). 
''UD,  undetectable, 
'^nd,  not  determined. 
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modified  by  PCMS  and  NEM  (294).  However,  unlike  other  related  AAA  proteins  such  as 
Cdc48  ofS.  cerevisiae  (57),  preincubation  of  MjPAN  with  ATP  did  not  protect  the 
protein  from  NEM  inhibition  (Fig.  5,  Lanes  7  and  8). 

The  purified  MjPAN  protein  catalyzed  a  very  high  rate  of  ATP  hydrolysis  when 
compared  to  other  AAA  family  members  (Table  6)  with  a  Fmax  of  3,500  nmol  Pj  released 
•  min"'  •  mg  protein"'  (Table  7).  The  only  comparable  ATPases  are  E.  coli  CIpA  (104)  and 

the  yeast  Cdc48  complex  (57)  with  Fmax  values  of  -5,000  and  1,560,  respectively  (Table 

6)  .  Several  complexes  which  contain  AAA- ATPases,  such  as  the  19S  complex  of  the  26S 
proteasome  (98),  have  over  100-fold  lower  Fmax  values  for  ATP  hydrolysis  than  MjPAN  . 
(Table  6).  The  affinity  of  MjPAN  for  ATP  {Km  =  497  |^M)  is  comparable  to  some  other 
purified  AAA  proteins  including  yeast  Cdc48  (57),  mammalian  NSF  (165),  and  E.  coli 
ClpX  (278)  (Table  6).  However,  MjPAN  had  a  much  lower  affinity  for  ATP  than  some 
other  AAA  proteins  including  E.  coli  proteins  ClpB  (290),  Lon  (280),  and  FtsH  (254). 
Also,  the  Rpt2p  subunit  as  well  as  the  19S  cap  of  the  26S  proteasome  have  a  much  higher 
affinity  for  ATP  than  does  MjPAN  (98, 153).  In  general,  there  appears  to  be  a  wide  range 
of  both  ATPase  activities  as  well  as  affinities  for  ATP  (at  least  in  vitro),  even  among  very 
closely-related  AAA  homologs. 

Surprisingly,  the  V^^  values  for  CTP  hydrolysis  by  MjPAN  and  MjPAN(Al-73) 
were  1 .7-  and  2-fold  higher,  respectively,  than  the  Fmax  values  for  ATP  hydrolysis  (Table 

7)  .  This  is  consistent  with  the  partially  purified  MjPAN  complex  from  M.  jannaschii  cell 
lysate  which  had  a  2:1  ratio  of  CTP-  to  ATP-hydrolyzing  activity  and  was  also  inhibited 
by  NEM.  Full-length  MjPAN  had  a  60%  higher  affinity  for  CTP  than  for  ATP,  while  the 
truncated  MjPAN(Al-73)  complex  showed  little  change  in  affinity  between  the  two 
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TABLE  6.  Comparison  of  Vmax  and      values  for  AAA-ATPases. 


Protein  Name 

Organism 

Vmdx 
(nmol  min  mg  ) 

(^M  ATP) 

Reference 

MjPAN 

M  jannaschii 

3500 

497 

this  study 

MjPAN  (Al-73) 

M.  jannaschii 

4800 

307 

this  study 

Cdc48 

S.  cerevisiae 

1560 

550 

(57) 

p97 

X.  laevis 

720 

nd 

(189) 

VAT 

V  r\  1 

1.  uciuupniiurn 

QO 

nu 

(151) 

ADC 

R.  stythropolis 

zoo 

zUU 

(289) 

IN  or 

riamsicr  ovary 

0  / 

OjU 

(241) 

(AAA-D2) 

6 

110 

(165) 

(AAA-Dl) 

257 

43300 

(165) 

Vps4p 

S.  cerevisiae 

448 

nd 

(11) 

Vhs4p  (Rpt2p) 

S.  cerevisiae 

7 

5 

(153) 

Vhs4pA  (Rpt2pA) 

S.  cerevisiae 

20 

5 

(153) 

ST  TGI  ^^Rntfin^ 

x\.ai  liver 

n 

1 

IK. 

(155) 

^Uo  piULCaaUIIlC 

PciKKit 
IvdDUU 

1  Q 

15 

(98) 

reticulocytes 

19S  (RP) 

Rabbit 

34 

30 

(98) 

reticulocytes 

ClpA 

E.  coli 

5000 

210 

(104) 

ClpB 

E.  coli 

50-100 

1100 

(290) 

ClpX 

E.  coli 

500 

500 

(278) 

FtsH 

E.  coli 

230-459 

80-83 

(254,  111) 

HslU  (CIpY) 

E.  coli 

57  (-HslV); 

280 

(295) 

213  (+HslV) 

nd,  not  determined 
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TABLE  7.  Kinetic  parameters  of  the  Methanococcus  jannaschii  PAN 
and  truncated  PAN  (Al-73)  proteins. 


Enzyme 

Substrate 

Am 

V 

'  max 

MjPAN 

ATP 

497 

3500 

CTP 

307 

5800 

MjPAN  (Al-73) 

ATP 

477 

4800 

CTP 

493 

9500 

VM  of  ATP  or  CTP. 

''nmol  Pi  released  per  min  per  mg  protein. 

'^For  determination  of  the  kinetic  parameters  of  MjPAN  and  MjPAN(Al-73),  enzyme 
(4  i^g  per  ml)  was  incubated  with  ImM  ATP  or  CTP  in  25niM  TES  buffer  at  pH8.0  with 
lOOmM  NaCl  and  lOmM  MgCb  at  80°C  for  PAN  or  65°C  for  MjPAN(Al-73).  Initial 
velocity  was  determined  from  0  to  6  min  by  measuring  the  release  of  Pj  in  triplicate. 
Initial  velocity  was  plotted  as  a  function  of  substrate  concentration  using  Lineweaver- 
Burk,  Michaelis-Menton,  and  Eadie-Hofstee  plots  (39). 
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nucleotides  (Table  7).  Of  the  nucleotides  tested,  the  full-length  MjPAN  protein 
hydrolyzed  only  ATP  and  CTP  and  did  not  hydrolyze  UTP,  TTP,  GTP,  or  ITP  (Table  8). 
In  contrast,  MjPAN(Al-73)  hydrolyzed  ITP,  GTP,  TTP,  and  UTP  at  92,  73,  37  and  19% 
the  rate  of  ATP  hydrolysis,  respectively  (Table  8).  Neither  MjPAN  or  MjPAN(Al-73) 
appeared  to  hydrolyze  nucleotide  diphosphates  ADP  or  CDP  (Table  8).  The  ATPase 
activity  of  full-length  MjPAN  was  strongly  inhibited  by  equimolar  concentrations  of 
ADP  to  5%  of  the  original  activity  but  was  only  slightly  inhibited  in  the  presence  of  an 
equimolar  concentration  of  CDP  (Table  9).  In  contrast,  the  CTPase  activity  of  full-length 
MjPAN  was  almost  completely  inhibited  by  ADP  and  also  strongly  inhibited  by  CDP  to 
16%  of  the  original  activity.  For  the  truncated  MjPAN(Al-73)  complex,  the  ATPase 
activity  was  also  strongly  inhibited  by  equimolar  ratios  of  ADP  to  1 0%  of  the  original 
activity,  while  CDP  had  no  detectable  effect  on  ATPase  activity.  The  CTPase  activity  of 
MjPAN(Al-73)  was  not  as  strongly  inhibited  by  either  ADP  or  CDP  as  was  full-length 
MjPAN  (Table  9).  Inhibition  by  ADP  has  also  been  observed  for  other  AAA-ATPases 
including  ClpB  (290)  and  HslU  (201).  Although  the  high  rate  of  CTP  hydrolysis  was 
somewhat  surprising,  the  results  are  consistent  with  previous  studies  which  have  shown 
that  CTP  can  substitute  for  ATP  hydrolysis  in  assembly  of  the  eukaryotic  20S  proteasome 
with  the  RP  complex  (60)  containing  the  six  AAA  Rpt  proteins  of  the  Base  domain  (72). 
In  addition,  another  recent  study  of  the  MjPAN  complex  reported  the  hydrolysis  of  GTP 
and  UTP  at  58%  and  CTP  at  205%  the  rate  of  ATP  hydrolysis  (303).  Other  AAA- 
ATPases  have  also  been  observed  to  hydrolyze  several  types  of  nucleotides,  including  the 
19S  cap  of  the  mammalian  26S  proteasome  and  E.  coli  Lon  which  can  hydrolyze  ATP, 
CTP,  UTP,  and  GTP  (98, 279).  The  ARC  complex  from  R.  erythropolis  (289)  can 
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TABLE  8.  Nucleotide-hydrolyzing  activities  of  the  Methanococcus  jamaschii 
PAN  and  truncated  PAN(Al-73)  proteins. 


Substrate 

Relative  Activity^ 

MjPAN 

MjPAN  (Al -73) 

ATP 

100±7 

100  ±4 

CTP 

189±5 

209  ±  8 

UTP 

UD 

19  ±6 

GTP 

UD 

73  ±3 

TTP 

UD 

37  ±0.3 

ITP 

UD 

92  ±  1 

ADP 

UD 

UD 

CDP 

UD 

UD 

AMP-PNP 

UD 

UD 

^Activity  is  expressed  as  a  percentage  of  inorganic  phosphate  released  compared  to  the 
control  reaction  using  the  indicated  substrate.  Enzyme  (2  |ag  per  ml)  was  incubated  with 
ImM  nucleotide  in  25mM  TES  buffer  at  pH  8.0  with  lOOmM  NaCl  and  lOmM  MgCl2  at 
80°C  for  MjPAN  or  65°C  for  MjPAN(Al-73). 
UD,  undetectable 
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TABLE  9.  The  effect  of  dinucleotides  on  nucleotide  triphosphate  hydrolyzing 
activities  of  recombinant  M.  jannaschii  PAN  and  PAN  (Al-73)  proteins 


Relative  Activity^ 
MjPAN  MjPAN  (Al-73) 


CTP  Hydrolysis 

Control 

100  ±  0.4 

100  ±2 

ADP 

ud'' 

30  ±  1 

CDP 

16  ±0.6 

23  ±2 

ATP  Hydrolysis 

Control 

100  ±7 

100  ±4 

ADP 

5  ±5 

11  ±3 

CDP 

87  ±2 

100  ±0.8 

^Relative  activity  is  expressed  as  the  percentage  of  inorganic  phosphate  released 
compared  to  control  reaction  without  added  nucleotide  diphosphate.  The  enzyme  (2  |xg 
per  ml)  was  preincubated  with  or  without  ImM  CDP  or  ADP  for  15  min  in  25  mM  TES 
at  pH  8.0  with  100  mM  NaCl  and  10  mM  MgCb  at  2rC  before  addition  of  ImM  ATP  or 
CTP  and  incubation  at  80°C  for  MjPAN  or  65°C  for  MjPAN  (Al-73). 
''UD,  undetectable. 


91 

hydrolyze  both  CTP  and  ATP  at  fairly  high  rates.  The  FtsH  protease  from  E.  coli 
hydrolyzes  CTP,  GTP,  and  UTP  at  107, 46,  and  27%  the  rate  of  ATP  hydrolysis, 
respectively;  however,  only  ATP  or  CTP  supported  degradation  of  the  substrate  a''^ 
protein  (111). 

P AN-Stimulated  ATP-dependent  Proteolysis  of  Protein  Substrates 
by  the  20S  Proteasome 

Both  full-length  MjPAN  and  MjPAN(Al-73)  complexes  stimulated  the 

degradation  of  P-casein  by  the  M.  jannaschii  20S  proteasome  by  2.2  ±  0.19-fold  and  1.4 

±  0.04-fold,  respectively.  This  stimulation  was  completely  dependent  on  the  presence  of 

either  ATP  or  CTP  (data  not  shown).  No  significant  energy-dependent  hydrolysis  of  P- 

casein  was  observed  in  the  presence  of  ADP,  and  stimulation  was  similar  at  molar  ratios 

ranging  from  0.5:1  to  4:1  of  PAN:20S  proteasome  complex.  Furthermore,  the  MjPAN 

and  MjPAN(Al-73)  complexes  stimulated  by  2-  to  4-fold  the  degradation  of  P-casein  by 

the  M  thermophila  20S  proteasome  in  the  presence  of  ATP  but  not  ADP  (Fig.  8).  Under 

these  assay  conditions,  the  rate  of  ATP  hydrolysis  by  MjPAN  was  not  influenced  by  the 

addition  of  the  synthetic  peptide  Suc-LLVY-AMC  or  the  20S  proteasome  alone  but  was 

enhanced  by  the  addition  of  P-casein  by  1 .22  ±  0.07-fold  or  P-casein  plus  the  20S 

proteasome  by  1.38  ±  0.08-fold  (data  not  shown).  In  addition,  CTP  could  substitute  for 

ATP  (at  -80%  the  rate)  in  stimulation  of  protein  hydrolysis  (Fig.  8).  This  is  consistent 

with  what  has  been  observed  for  other  energy-dependent  proteases  including  the 

eukaryotic  26S  proteasome  (60),  as  well  as  E.  coli  proteins  Lon  (279)  and  FtsH  (254). 

Somewhat  surprisingly,  5'-adenylyl  P,Y-imidophosphate  (AMP-PNP),  which  is  not 

hydrolyzed  by  the  MjPAN  protein,  also  supported  MjPAN  activation  of  P-casein 
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Time  (min) 


FIG.  8.  Nucleotide-dependent  MjPAN  and  MjPAN  (Al-73)  stimulation  of  |3-casein 
degradation  by  the  Methanosarcina  thermophila  20S  proteasome.  The  20S 

proteasome  was  incubated  at  55°C  with  or  without  addition  of  MjPAN  proteins  and/or 
nucleotides,  as  indicated.  The  buffer  used  was  25  mM  TES  (pH8)  with  lOmM  MgCl2, 
ImM  nucleotide,  and  2  mg  of  P-casein  per  ml.  Newly  released  a-amino  groups  generated 
by  protein  hydrolysis  were  measured  using  fluorescamine.  Symbols:  • ,  MjPAN  and  20S 
proteasome  with  ImM  ATP;  ■ ,  MjPAN  and  20S  proteasome  with  ImM  CTP; 
^,  MjPAN  (Al-73)  and  20S  proteasome  with  ATP;  □  ,  20S  proteasome  alone  with  ImM 
ATP;  O  ,  MjPAN  and  20S  proteasome  with  ADP. 
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Time  (min) 

FIG.  9.  ATP-  and  AMP-PNP-mediated  MjPAN  stimulation  of  p-casein  degradation 
by  the  Methanosarcina  thermophila  20S  proteasome.  The  20S  proteasome  was 
incubated  at  55°  C  with  or  without  addition  of  MjPAN  protein  and/or  nucleotides,  as 
indicated.  The  buffer  used  was  25mM  TES  (pH8)  with  lOmM  MgCl2,  ImM  nucleotide, 
and  2  mg  of  (3-casein  per  ml.  Newly  released  a-amino  groups  generated  by  protein 
hydrolysis  were  measured  using  fluorescamine.  Symbols:  • ,  MjPAN  and  20S 
proteasome  with  ImM  ATP;  ■ ,  MjPAN  and  20S  proteasome  with  AMP-PNP;  0 ,  20S 
proteasome  alone  with  ATP;      MjPAN  and  20S  proteasome  with  ADP. 
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degradation  (Fig.  9).  However,  the  mechanism  of  degradation  appeared  to  be 
qualitatively  different.  When  ATP  or  CTP  was  hydrolyzed  there  was  a  rapid  increase  in 
the  generation  of  free  a-amino  groups  which  then  leveled  off.  This  plateau  in  activity 
could  be  due  to  an  increase  in  the  ADP  or  CDP  concentrations  which  then  inhibit  the 
nucleotidase  activity  of  the  MjPAN  complex.  In  contrast,  there  was  a  significant  lag 
phase  observed  when  AMP-PNP  was  present.  In  addition,  there  was  no  indication  that  the 
generation  of  new  a-amino  groups  was  reaching  a  plateau  (Fig.  9).  Although  these  results 
contrast  with  those  of  another  recent  study  which  reported  that  the  complex  of  Hise- 
PAN/PAN(Al-73)  in  the  presence  of  AMP-PNP  does  not  activate  the  breakdown  of 
proteins  by  the  T.  acidophilum  20S  proteasome  (303),  the  methods  used  to  monitor 
protein  hydrolysis  were  different.  In  the  present  study,  hydrolysis  was  measured  by  the 
production  of  new  a-amino  groups  using  fluorescamine,  whereas  the  previous  study 
measured  conversion  of  P-['''C]casein  into  acid-soluble  products  (303).  Together,  these 
results  suggest  that  the  presence  of  a  nonhydrolyzable  ATP  analogue,  such  as  AMP-PNP, 
may  support  only  a  single  cleavage  or  a  small  number  of  cleavages  of  P-casein  by  the 
20S  proteasome.  This  would  result  in  the  generation  of  large,  acid-precipitable  fragments 
with  new  amino  groups.  Thus,  it  is  possible  that  ATP  hydrolysis  is  required  for  the 
MjPAN-stimulated  processive  degradation  of  proteins  to  acid-soluble  products  by  the 
20S  proteasome.  However,  ATP  (or  a  non-hydrolyzable  analog  such  as  AMP-PNP) 
binding  to  MjPAN  supports  limited  and  non-processive  hydrolysis  of  proteins  by  the  20S 
proteasome.  Similar  results  were  seen  with  the  Lon  protease  of  E.  coli  (52).  Edmunds  and 
Goldberg  demonstrated  that  non-hydrolyzable  nucleotide  analogs  such  as  AMP-PNP 
could  support  cleavage  of  casein  by  Lon  into  several  large  fragments,  but  processive 
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degradation  into  acid-soluble  small  fragments  was  only  observed  in  the  presence  of  ATP 
(52).  The  HslUV  protease  can  also  support  proteolysis  of  protein  and  peptidase  substrates 
in  the  presence  of  AMP-PNP,  but  there  is  an  initial  lag  time  in  proteolysis  that  is  not  seen 
with  ATP  (103).  Interestingly,  HslU  has  two  C-terminal  cysteine  residues,  and  mutation 
of  one  (C261 V)  blocks  the  NEM-mediated  monomerization  of  the  complex,  but  has  no 
effect  on  ATPase  activity  (294).  Mutation  of  the  other  C-terminal  cysteine  residue 
(C287V)  abolishes  the  ATPase  activity  of  the  complex;  however,  the  mutant  HslU  can 
still  form  a  complex  with  HslV  to  support  protein  hydrolysis  (294).  These  results  indicate 
that  ATP  binding,  but  not  necessarily  hydrolysis,  is  required  for  oligomerization  of  HslU 
and  for  interaction  of  HslU  with  HslV.  Finally,  the  nonhydrolyzable  analog  ATP-yS  can 
support  the  chaperone  activity  of  ClpX  in  the  protection  of  XO  from  aggregation,  but 
ATP  hydrolysis  is  required  for  the  disassembly  of  A,0  aggregates  (278). 

The  N-terminal  coiled-coil  region  of  the  MjPAN  protein.  The  effect  of  an  N- 
terminal  truncation  has  been  studied  in  other  AAA  proteins,  including  ClpA  (217)  and 
ClpB  (290)  from  E.  coli  and  Vhs4p  (Rpt2p)  from  5'.  cerevisiae  (153).  When  the  ClpB 
protein  from  E.  coli  was  expressed  from  a  plasmid,  it  purified  as  a  complex  composed  of 
both  93-  and  79-kDa  proteins  (236,  290).  When  the  open  reading  frame  is  modified  to 
produce  only  the  79-kDa  protein  from  the  internal-translation  start  site,  a  tetrameric 
complex  still  formed  (183).  The  truncated  ClpB  complex  had  higher  ATPase  activity  than 
the  friU-length  complex,  but  the  ATPase  activity  was  not  stimulated  by  protein  substrates 
indicating  that  the  N-terminal  domain  may  contain  a  region  essential  for  substrate  protein 
binding  (183).  Furthermore,  when  increasing  concentrations  of  the  truncated  ClpB  were 
mixed  with  the  fiiU-length  ClpB,  the  truncated  ClpB  was  found  to  inhibit  the  protein- 
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activated  (but  not  the  basal)  ATPase  activity  of  full-length  ClpB  (183).  Interestingly,  an 
N-terminal  deletion  (Al-50)  of  the  yeast  Yhs4p  also  results  in  a  2.8-fold  increase  in 
ATPase  activity  compared  to  the  fiill-length  protein  (Table  6);  however,  the  ability  of  this 
modified  protein  to  support  ATP-dependent  protein  degradation  has  not  been  determined 
(153).  Finally,  based  on  a  study  where  the  human  Rpt2p/S4  of  the  RP  was  synthesized 
without  the  N-terminal  85-amino-acid  coiled-coil  region,  it  was  determined  that  this 
region  of  the  protein  is  necessary  for  interaction  with  the  Rptlp/S7  subunit  (196).  These 
studies  demonstrate  that  the  shorter  proteins  derived  from  the  internal  translation  sites  of 
these  ORFs  act  to  inhibit  the  activity  of  the  full-length  protein.  Similar  results  were 
obtained  in  this  study.  Even  though  the  MjPAN(Al-73)  complex  had  higher  ATPase 
activity  than  the  full-length  MjPAN  complex,  it  supported  the  ATP-dependent  hydrolysis 
of  casein  by  the  20S  proteasome  to  only  -60%  the  rate  of  MjPAN  (Fig.  8). 

Association  of  MjPAN  with  Archaeal  20S  Proteasomes 
The  purified  M.  jannaschii  20S  proteasome  and  MjPAN  proteins  were  mixed  at 
equimolar  ratios  and  then  incubated  in  the  presence  of  ATP  and  Mg^^  at  50°  C.  Electron 
micrographs  of  these  mixtures  reveal  particles  which  resemble  the  20S  core  of  12  nm 
width  by  17  nm  length  capped  either  singly  or  doubly  on  each  end  by  MjPAN  protein 
complexes  (Fig.  10).  The  singly  capped  proteasome-associated  MjPAN  complexes  were 
-24  nm  long  and  12  to  14  nm  wide.  These  micrographs  are  consistent  with  the  ability 
ofthe  MjPAN  proteins  to  stimulate  proteolysis  by  20S  proteasomes  and  suggest  that 
archaeal  20S  proteasomes  may  interact  with  MjPAN  to  form  a  complex  which  resembles 
the  26S  proteasome  subcomplex  recently  described  in  S.  cerevisiae  (72).  The  efficiency 


FIG.  10.  Transmission  electron  micrographs  of  reconstituted  20S  proteasome 
and  MjPAN  proteins.  A.  Untilted  view  of  particle  generated  during  reconstitution  of 
the  MjPAN(Al-73)  and  20S  proteasome.  Arrowhead  indicates  cylindrical  assembly 
with  apparent  comma-shaped  assembles  of  putative  ]VIjPAN(Al-73)  dodecamers  at 
both  ends.  B.  Stereo  view  of  single  20S  cylinder  with  putative  comma-shaped 
MjPAN  assembly  at  left  end.  Bars,  20  nm. 
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of  reconstitution  of  the  MjPAN:20S  proteasome,  however,  was  very  low  (less  than  0.5%), 
indicating  that  either  1)  the  interaction  between  the  proteins  may  be  transitory,  2)  the 
complexes  are  not  stable  during  preparation  for  electron  microscopy,  or  3)  additional 
factors  may  be  necessary  for  stable  complex  formation. 

Proteasomal  Components  in  M.  maripaludis 
Even  though  the  entire  genome  of  M  jannaschii  has  been  sequenced,  and  several 
recombinant  proteins  have  been  characterized,  little  is  known  of  the  physiology  or 
genetics  of  this  hyperthermophilic  archaeon.  While  in  vitro  studies  using  recombinant 
proteins  have  been  useful,  a  better  understanding  of  the  physiological  functions  and 
substrates  of  the  energy-dependent  proteolytic  system  in  methanogens  and  other  archaea 

™  will  require  the  ability  to  perform  in  vivo  studies.  Both  genetic  tools  and  high-yield 
culture  techniques  have  been  developed  for  use  in  the  mesophilic  methanoarchaeon  M 
maripaludis.  Over-expression  of  recombinant  proteins  which  require  the  reducing 
environment  provided  by  the  M  maripaludis  cytosol,  and  thus  are  not  expressed  in  an 
active  form  in  E.  coli,  has  also  been  demonstrated  (62).  These  techniques  should  aid  in 
purification  and  characterization  of  recombinant  proteins  from  methanogens. 

'  :  .  Isolation  of  the  Mmpan  Gene  from  a  M  maripaludis  Genomic  Library  Southem 

analysis  was  performed  using  M.  maripaludis  genomic  DNA  which  had  been  digested 
v^th  several  restriction  enzymes,  including  EcoKV,  Xbal,  and  Sma\.  When  the  Southem 
blot  was  probed  using  the  M.  jannaschii  pan  (A  1-73)  gene  fragment  {M]pan  probe),  a  9- 
kbp  Xbal  and  a  2.5-kbp  EcoKV  positive  fragment  were  identified.  Digestion  of  genomic 
DNA  with  Sspl  (with  recognition  sequence  AAT^ATT)  yielded  numerous  small  positive 
fragments,  due  to  the  high  A-T  content  (67%)  of  the  M.  maripaludis  genome.  A  random 
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M.  maripaludis  genomic  X  library  was  then  screened  using  the  M]pan  probe  to  isolate  the 
related  gene  from  M.  maripaludis.  Approximately  1600  total  plaques  were  screened,  and 
14  positive  clones  were  identified.  Plasmid  subclones  from  all  14  positive  clones  were 
then  obtained  by  automatic  Cre-lox-mediated  subcloning  using  the  host  strain  BM25.8 
(Table  3).  This  procedure  yielded  >500  colonies  for  all  14  clones  after  incubation  at  37° 
C  overnight,  and  three  clones  (#3,  #8,  and  #1 1)  were  chosen  for  fiirther  study.  Clone  #1 1 
contained  a  plasmid  (pJAM  721,  Table  4)  with  a  ~15-kbp  insert,  which  yielded  a  9-kbp 
Xbal  and  a  2.5-kbp  EcdRM  positive  fragment  during  Southern  analysis.  The  407  amino- 
acid  long  deduced  sequence  of  the  isolated  1224-bp  ORF  (Fig.  1 1)  was  70%  identical  to 
MjPAN.  Interestingly,  when  the  ORF  was  examined,  two  possible  translation  start  sites 
were  detected,  both  with  putative  Shine-Dalgamo  consensus  sequences  (Fig.  11). 
However,  there  was  an  in-frame  stop  codon  39-bp  downstream  from  the  first  start  site, 
suggesting  that  translation  from  this  site  would  produce  a  16  amino-acid  long  peptide.  To 
confirm  the  accuracy  of  the  DNA  sequence  and  to  ensure  that  the  correct  start  site  was 
chosen  for  cloning  ofMmpan,  the  upstream  region  5'  to  the  start  site  was  amplified 
directly  from  M  maripaludis  genomic  DNA  using  the  high-fidelity  Vent  DNA 
polymerase.  The  PCR  fragment  was  cloned  into  a  pCR-TOPO  vector  to  create  plasmid 
pJAM745  (Table  4).  Sequencing  of  this  plasmid  in  both  directions  verified  that  the 
upstream  region  of  the  gene  isolated  directly  from  genomic  DNA  was  identical  to  that 
obtained  from  the  X  library  clone.  In  addition,  when  the  Mn\pan  sequence  from  this  study 
was  compared  to  Mmpan  (ORF558)  from  the  M.  maripaludis  LL  genomic  sequence  (J. 
Leigh,  personal  communication),  the  DNA  sequence  was  96%  identical.  Furthermore,  the 
nucleotide  substitutions  were  all 


100 


5' -cttgtttttgttttgtttcatttttgaatactgcctttggttttctagaatatcctt 
taggcgacattccaaattttgcgcaacgctcgcaaacattcatttctacgccttctactc 

Oligo  #1:   5' -cctcttttccgcagagttcgcattg-3' 

ttgttttaatgatattcgtaacctcttttccgcagagttcgcattgcatacctacacctt 

aatatactaaatttaaagatatttttgtccaagttatcctgattacatagcattcaaatg 
gtaaatgcttcattagagaatatatcttcaattatttaatattatctatttttgctattt 

Box  A  Box  B 


atatattatatcc 


ataaatttatatagtccaaataaattataaacatatacccttattga 


ataataqttgtaaaaatgttttcgataqgtgtaaaaactqttttgcqagaaaqaaaatca 

atgagaaaqcqctatataqaactatttttatcatatttattacqqaqqtaaatcatgagt 
MRKRYIELFLSYLLRR*IMS 

tacccagatgattattccacagatatagaaaaaaataaaatggatttaaaagaatttaaa 
YPDDYSTDIEKNKMDLKEFK  22 

gaaaaaactcagatcgcagagttagaaagcaaagtattgagattggaattaaaaaataaa 
EKTQIAELESKVLRLELKNK  42 

gatattaatagagagaatgtacaaattaaaaaagaaaatgaaatattaaagcgagaactt 
DINRENVQIKKENEILKREL  62 

gataaattaagaattccaccactcatactcggtacaatacttgataaagtaaatgaaaga 
DKLRIPPLILGTILDKVNER  82 

aaggctgttgttaaaagttctacagggccaaacttccttgtaaacctttcacaattcgta 
KAVVKSSTGPNFLVNLSQFV  102 

gatcctgaggatattgttccaggagcaagagtttgtttaaaccagcagactttagcgatt 
DPEDIVPGARVCLNQQTLAI  122 

Oligo  #2:3' -gtggctcgataccgttacctttag-5' 

gttgaagtattgcctaaagaaaaagactaccgagctatggcaatggaaatcgaggaaaaa 
VEVLPKEKDYRAMAMEIEEK  142 

ccagacatttcctttgaagatatcggtggtttgaacaaccagattagagatattaaagaa 
PDISFEDIGGLNNQIRDIKE  162 

gtagtagaactcccacttaaaaacccagaattatttgaaaaagtgggtatcgttccacca 
VVELPLKNPELFEKVGIVPP  182 

aaaggagtattactctatggtccgccaggaactggaaaaacgctccttgcaaaagctgtt 
KGVLLYGPPGTGKTLLAKAV  202 

gcatacgaaaccaatgcatcatttgttagagttgtcggctcagaactcgtcaaaaaattc 
AYETNASFVRVVGSELVKKF  222 

FIG.  11.  Deduced  amino  acid  sequence  of  the  Mmpan  gene.  Putative  ribosomal  binding  sites 
are  double  underlined,  putative  start  codons  are  in  bold,  and  putative  stop  codons  are  indicated  by 
an  asterisk.  Amino  acid  positions  of  the  MmPAN  protein  deduced  from  the  DNA  sequence  are 
indicated  on  the  right.  Oligonucleotides  used  to  generate  plasmid  pJAM  745  are  indicated  above 
the  nucleotide  sequence.  Putative  promoter  sequence  (Box  A)  and  transcription  start  site  (Box  B) 
are  indicated. 
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atcggtgaaggtgcaaaacttgtaagagatgttttcaaacttgcaaaagaaaaatccccc 
IGEGAKLVRDVFKLAKEKSP  242 

tgcatcatatttattgatgaaatcgatgcagttgcaagcaaaagaactgaatcattaacc 
CIIFIDEIDAVASKRTESLT  262 

ggtggagatagagaagttcaaagaactttaatgcagcttcttgcagaaatggatggtttt 
GGDREVQRTLMQLLAEMDGF  282 

gattcaagaggcgatgtaaaaatcattgctgcgacaaacagaccagacattttagatcct 
DSRGDVKIIAATNRPDILDP  302 

gcaatactcagacctggaagatttgatagaatcattgaaatttcaatgcctgatgaagat 
AILRPGRFDRIIEISMPDED  322 

ggaagactcgaaatcttaaaaatccacactgaaaaaatgaaccttaaaggtgtggactta 
GRLEILKIHTEKMNLKGVDL  342 

agagaagttgcaaaaattgcagaaaacatggtcggtgctgacttaaaagcagtatgtact 
REVAKIAENMVGADLKAVCT  362 

gaagctggtatgtttgcaataagagaagaaagagaattcatcaaaatggatgacttcaga 
EAGMFAIREEREFIKMDDFR  382 

gaagctatctcaaaaatcacaggtaagaaagaaaaatgcagttacgacatgccacaatta 
EAISKITGKKEKCSYDMPQL  402 

accgtaatgtacggataattaactactattttttcaacttatttttattttaaatttttt 
T     V     M     Y     G     *  407 

taagtgatgtttatgaagattggaattattggagcaggccttgggggcctattaagcggt 

gcaattttatcaaaagaacactcagttacaatttacgaaaaacttccatttgttggaggt 

agattacaaatattcctacaaaggattcaactgacaacaggagccttcacatgatcctca 

tggaactacgggtatttagcacaa  -3' 


FIG.  11.  Cont. 
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conservative,  and  the  deduced  amino  acid  sequences  of  the  two  ORFs  were  identical.  The 
ORF558  also  had  an  identical  5'  region,  with  conserved  in-frame  start  and  stop  codons  as 
well  as  two  putative  ribosomal  binding  sites.  Thus,  both  M  maripaludis  strains  may 
produce  an  identical  small  peptide  of  1 6  amino  acids  with  the  sequence 
Mi?X7?YIELFLSYLL/2i?  (basic  and  hydrophobic  residues  are  indicated  by  italics  and 
bold,  respectively).  This  putative  peptide  is  highly  basic  at  the  N-  and  C-termini,  with 
patches  of  hydrophobic  and  aromatic  residues  in  the  middle.  The  N-terminus  of  MjPAN 
is  also  highly  charged  with  alternating  basic  and  acidic  residues  but  very  few 
hydrophobic  or  aromatic  residues.  Whether  or  not  this  small  peptide  is  produced  in  the 
cell  and  what  purpose  it  might  serve  has  not  been  determined  at  this  time.  As  can  be  seen 
from  a  multiple  sequence  alignment  of  archaeal  PAN  proteins  (Fig.  12 A),  MjPAN  has  an 
N-terminal  extension  that  is  not  seen  in  any  other  archaeal  PAN  homolog.  This  included 
PAN  proteins  from  other  hyperthermophilic  organisms.  A  sequence  alignment  of  the  N- 
terminal  region  of  MjPAN  to  the  16  amino-acid  long  peptide  upstream  from  MmPAN 
reveals  that  the  N-terminal  region  of  MjPAN  has  some  identity  to  this  peptide  (Fig.  12B). 
Biochemical  results  from  this  study  (discussed  above)  indicate  that  the  N-terminus  of 
MjPAN  may  help  stabilize  the  protein  in  high  temperature.  Thus,  it  is  possible  that  the  N- 
terminus  of  MjPAN,  which  is  required  for  the  extreme  thermostability  of  this  protein  in 
M  jannaschii  is  dispensable  in  the  mesophilic  M.  maripaludis.  A  future  experiment  to 
test  this  hypothesis  would  involve  mutating  the  stop  codon  of  this  peptide  so  that 
MmPAN  is  produced  with  an  1 8  amino-acid  long  N-terminal  extension.  The 
thermostability  and  other  biochemical  properties  of  this  protein  could  then  be  compared 
to  the  wild-type  MmPAN. 
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MmPAN 

MjPAN 

AF1976 

MTH0728 

PH0201 

PAB2233 

APE2012 

AAG20171 

AAG19042 


1  MSYPDDYSTDIEKNKMDLKEFKEKTQIAE 

1  MVFEEFI STELKKEKKAFTEEFKEEKE  INDNSI^JkNDLLKEELQEKARIAE 

1  mgdseiqylleklkkleedyyl 

1  mennsqnvlkkiediJkkevpmlkeenskt: 

1  ms\^sgdevqfqgdyddyity 

1  MNii; 

1  Mn" 

1  MSRSPSLPDRPTLDVDPESTPAERLNALQDHYVDgVAVN 

1  


MNIgSGDEVQFHENYDDYITYjj 
MHJSSAGGSRSHRHNGGHSERDVEJ 


MmPAN 

35 

MjPAN 

57 

AF1976 

28 

MTH0728 

38 

PH0201 

27 

PAB2233 

27 

APE2012 

26 

AAG20171 

40 

AAG19042 

1 

S :  :  II  e  ss 


MmPAN 

MjPAN 

AF1976 

MTH0728 

PH0201 

PAB2233 

APE2012 

AAG20171 

AAG19042 


MnciPAN 

MjPAN 

AF1976 

MTH0728 

PH0201 

PAB2233 

APE2012 

AAG20171 

AAG19042 


Walker  A 


GPPGTGK' 
GPPGTGK' 
GPPGTGK' 
GPPGTGK' 

gppgHgk' 

GPPGTGK' 
GPPGTGK' 
GPPGTGK 
GPPGTGK 


FIG.  12.  A.  Multiple  sequence  alignment  of  Archaeal  PAN  homologs.  Abbreviations: 
AF1976,  A.fulgidus  PAN;  MTH0728,  M  thermoautotrophicum  PAN;  PH0201,  P. 
horikoshii  PAN;  PAB2233,  P.  abyssi  PAN;  APE2012,  A.  pernix  PAN;  AAG20171  and 
AAG 19042,  H.  volcanii  PAN  paralogs.  Wallcer  A  and  B  consensus  sequences  are  boxed. 
Conserved  cysteine  residue  in  C-terminus  is  indicated  by  arrows  above  and  below  the 
sequence.  B.  Pairwise  sequence  alignment  of  the  N-terminus  of  MjPAN  and  the  16- 
residue  ORF  in  the  upstream  region  of  MmPAN. 
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GD 
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MmPAN 

MjPAN 

AF1976 

MTH0728 

PH0201 

PAB2233 

APE2012 

AAG20171 

AAG19042 


394  KCSYDMPQLT^YG  

417  VKVKEPAHLDffljyR  

388  PIPDLKGVMfS-  

398  EEEYKQETGvSfG  

386  KLLQQITSHESgYG  

386  KLLQQITSHe5jYG  

386  KRGGGDPFIRAQQKSGDDTIATVI 

399  TESSGPRYPSY^Q  

357  PAAATDVSRTFA  


B. 

MmPAN  MRKRfllgJqjLHYfflLj^RIM 
MjPAN     — MVraElSWsHElfcgBEKK 


FIG.  12.  Cont. 
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Purification  of  MmPAN  from  Recombinant  E.  coli 
The  Mmpan  ORP  was  cloned  into  pET24b  and  pETlSb  expression  vectors  to 
generate  plasmids  pJAM727,  pJAM728,  and  pJAM730  (Table  4).  These  plasmids  were 
used  to  synthesize  MmPAN  in  recombinant  E.  coli  strains  supplemented  with  the  rare 
tRNA  genes.  The  MmPAN  proteins  that  were  produced  in  recombinant  E.  coli  cross- 
reacted  with  the  MjPAN  antibody  based  on  Western  analysis  (data  not  shown).  The  N- 
terminal  His6-tagged  MmPAN  protein  (N-Hise-MmPAN)  formed  inclusion  bodies  and 
aggregates,  with  a  small  amount  of  protein  in  the  soluble  fraction.  The  soluble  protein 
bound  to  a  Ni  -Sepharose  column,  but  had  no  detectable  ATPase  activity  and  formed 
aggregates  when  stored  at  4°C.  The  C-terminal  Hise-tagged  protein  (C-Hise-MmPAN) 
formed  fewer  aggregates  and  inclusion  bodies  than  N-Hise-MmPAN,  and  more  protein 
was  in  the  soluble  fraction.  The  C-Hise-MmPAN  fractions  which  eluted  after  purification 
using  a  Ni  -Sepharose  column  were  loaded  onto  a  Superose  6  size  exclusion  column. 
The  C-His6-MmPAN  eluted  from  the  column  in  a  broad  peak  from  >1.5-MDa  to  500- 
kDa,  but  very  little  ATPase  activity  was  detected  in  these  fractions.  For  purification  of 
the  unmodified  MmPAN  protein,  E.  coli  (pJAM727)  cell-free  extract  was  applied  to  a 
Superose  6  column.  While  some  MmPAN  protein  eluted  in  the  void  volume,  (>1.5-MDa) 
there  was  also  a  peak  of  MmPAN  at  600-kDa.  These  fractions  were  fiirther  purified  using 
a  MonoQ  column,  where  the  protein  eluted  at  500  mM  NaCl.  Based  on  non-denaturing 
native-gel  electrophoresis  (Fig.  13),  this  600-kDa  fraction  formed  a  distinct  complex 
which  migrated  similar  to  MjPAN.  However,  very  little  ATPase  activity  was  detected  for 
MmPAN,  and  the  protein  aggregated  when  incubated  at  50°  C  (Fig.  13). 
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Partial  Purification  and  Characterization  of  Two  NTPases  from  M.  maripaludis 
During  attempts  to  isolate  MmPAN  directly  from  M.  maripaludis,  two  complexes 
of  200-kDa  (NTPase  #1)  and  250-kDa  (NTPase  #2)  with  NEM-inhibitable  nucleotidase 
activity  were  identified  (Fig.  14)  and  partially  purified  (Fig.  15).  Neither  of  these  NTPases 
cross-reacted  with  the  anti-MjPAN  antibody  based  on  Western  analysis.  The  NTPase  #1 
formed  a  distinct  complex  based  on  native  gel  electrophoresis  (Fig.  15),  and  had  a  low  but 
detectable  ATPase  activity  of  1.58  nmol  Pj  released  ■  min"'  ■  mg  protein  '  (Table  10).  The 
NTPase  #1  hydrolyzed  CTP  at  69%  the  rate  of  ATP  hydrolysis,  and  the  ATPase  activity 
was  inhibited  by  equimolar  concentrations  of  ADP  to  43%  of  the  original  activity.  The 
CTPase  activity  was  only  slightly  inhibited  by  an  equimolar  concentration  of  CDP.  Pre- 
incubation with  NEM  resulted  in  inhibition  to  50%  of  the  original  activity.  The  NTPase 
#2  also  formed  a  distinct  complex  based  on  native  gel  electrophoresis  (Fig.  15)  and 
hydrolyzed  ATP  at  a  rate  of  3.14  nmol  Pi  released  •  min  '  •  mg  protein"'.  Similar  to 
NTPase  #1,  the  ATPase  activity  of  NTPase  #2  was  also  inhibited  by  equimolar 
concentrations  of  ADP  to  54%  of  the  original  activity  (Table  11).  The  NTPase  #2  also 
hydrolyzed  CTP  at  65%  the  rate  of  ATP  hydrolysis,  and  this  activity  was  not  inhibited  by 
an  equimolar  concentration  of  CDP.  However,  NTPase  #2  was  much  more  sensitive  to 
inhibition  by  5  mM  NEM  than  NTPase  #1,  retaining  only  7.5%  of  the  activity  measured 
in  the  absence  of  inhibitor  (Table  11). 
Purification  and  Characterization  of  a  20S  Proteasome  (Mm20S)  from  M.  maripaludis 
A  20S  proteasome  was  identified  (Fig.  14)  and  isolated  from  M.  maripaludis  cell 
free  extract  (Table  12).  The  molecular  masses  of  the  a  and  P  subunits  were  estimated  to 
be  28  and  24-kDa,  respectively,  based  on  SDS-PAGE  (Fig.  16A),  which  is  consistent 


FIG.  14.  Q  Sepharose  fractions  during  purification  of  a  20S  proteasome  and  two 
NTPase  proteins  from  M.maripaludis  ceil  lysate.  ATPase  specific  activity  (nmol  Pj 
per  min  mg"'  protein)  in  the  absence  (A)  and  presence  (A)  of  lOmM  NEM.. 
(•)  Relative  Intensity  measured  by  release  of  7-amino-4-methylcoumarin  from  the 
fluorogenic  substrate  LLVY-AMC.  NTPase  peaks  #1  and  #2  are  indicated.  Note  that 
NTPase  #1  is  not  as  inhibited  by  lOmM  NEM  as  NTPase  #2. 
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TABLE  10:  Biochemical  properties  of  NTPase  #1  from  M  maripaludis. 


Activity  %  Control 

(nmol  Pi  min"'mg'')  (no  inhibitor) 

ATPase  Activity 

ImM  ATP  only                                   1.58  100 

1  mM  ATP  +  1  mM  ADP                        0.69  43 

1  mM  ATP  +  5  mM  NEM                       0.81  50 

CTPase  Activity 

ImMCTPonly                                   1.09  100 

1  mM  CTP  +  1  mM  CDP                       0.84  77 


TABLE  1 1 :  Biochemical  properties  of  NTPase  #2  from  M  maripaludis. 


Activity  %  Control 

(nmol  Pi  min"'mg"')  (no  inhibitor) 

ATPase  Activity 

ImM  ATP  only                                   3.14  100 

1  mM  ATP  +  1  mM  ADP                        1 .70  54 

1  mM  ATP  +  5  mM  NEM                      0.24  7.5 

CTPase  Activity 

ImMCTPonly                                   2.07  100 

1  mM  CTP  +  1  mM  CDP                        2.00  97 


Ill 

with  the  genome  sequence  of  M  maripaludis  LL.  The  20S  proteasome  assembled  into  a 
700-kDa  complex  based  on  Superose  6  gel  filtration,  and  the  subunits  cross-reacted  with 
anti-Mta  and  anti-Mtp  antibodies  based  on  Western  analyses  (Fig.  16B).  In  addition, 
there  were  two  bands  detected  on  the  Western  blot  probed  with  anti-Mta  anitbody  (Fig. 
16B,  Lane  3).  There  are  two  possible  reasons  for  this.  Either  the  P  subunit  is  cross- 
reacting  with  the  anti-Mta  antibody,  or  the  a  subunit  is  being  cleaved  to  create  a  smaller 
protein.  Pairwise  sequence  aligrmient  of  the  p  subunit  of  Mm20S  to  Mta  indicates  that 
they  are  32%  identical  (50%  similar)  at  the  amino  acid  level.  Therefore,  cross-reaction  of 
the  P  subunit  with  the  anti-Mta  antibody  is  possible.  The  purified  20S  proteasome 
hydrolyzed  the  substrate  LLVY-AMC  at  an  optimum  of  85°C  (Fig.  17).  In  addition,  the 
protein  was  stable  when  incubated  at  80°C  for  15  minutes  before  addition  of  the  LLVY- 
AMC  substrate,  retaining  70%  of  the  activity  compared  to  a  control  sample  incubated  at 
25°C  (data  not  shown).  While  this  temperature  optimum,  which  is  43°C  higher  than  the 
growth  temperature  optimum  of  the  organism,  was  somewhat  surprising,  other  20S 
proteasomes  have  also  demonstrated  a  higher  optimum  for  peptide  hydrolysis  than  the 
growth  temperature  of  the  organism  (156,  286).  The  peptide-hydrolyzing  activity  was 
also  measured  using  various  synthetic  substrates  (Table  13).  Of  the  substrates  tested,  the 
proteasome  had  the  highest  rate  of  cleavage  carboxyl  to  the  aromatic  residue  tyrosine 
using  the  substrate  Suc-LLVY-AMC.  The  chymotrypsin-like  (CL)  activity  was  much 
lower  when  the  substrates  Suc-AAF-AMC  or  Suc-LY-AMC  were  used  (Table  13).  The 
proteasome  also  had  significant  PGPH  activity  using  the  substrate  Cbz-LLE-pNa  (Table 
13).  The  activities  observed  against  the  substrates  Suc-LLVY-AMC  and  Cbz-LLE-pNa 
were  higher  than  any  other  previously  characterized  20S  proteasome  except  for  the 
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TABLE  12.  Purification  of  a  20S  proteasome  from  M  maripaludis. 


Total 
Protein 
(mg) 

Specific 
Activity 

Total 
activity 
(nmol/min) 

Punfication 
(fold) 

% 

Recovery 

Cell  free  extract 

518 

0.70 

363 

1 

100 

QSepharose  26/10 

64.4 

0.74 

47.6 

1.1 

13 

PEG  concentration 

48.8 

1.01 

65.3 

1.45 

18 

Superose  6 

1.88 

8.66 

16.3  : 

12.4 

4.5 

Hydroxyapatite 

0.07 

92.65 

6.49 

132.4 

1.8 

Mono  Q  HR  5/5 

0.013 

258 

3.42 

368 

1.0 
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FIG.  17.  Effect  of  temperature  on  the  LLVY-AMC  hydrolysis  activity  of  the  20S 
proteasome  from  M  maripaludis.  Purified  20S  proteasome  (0. 1  \ig  per  assay)  was 
incubated  in  25  mM  Tris,  5  mM  DTT  at  pH  7.5  using  20  |aM  LLVY-AMC  as  a 
substrate.  Release  of  fluorogenic  7-amino-4-methylcoumarin  was  measured  after 
incubation  at  the  indicated  temperatures.  Data  presented  are  the  average  of  two 
experiments. 
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TABLE  13:  Multiple  peptide-hydrolyzing  activities  of  the 
20S  proteasome  from  M.  maripaludis. 


Activity* 

Substrate'' 

Specific  Activity*^ 

CL 

Suc-LLVY-AMC 

396.0 

Suc-AAF-AMC 

87.4 

oUC-lj  1  -/VlVll^ 

14  Q 

PGPH 

Cbz-LLE-pna 

243.8 

TL 

Benz-FVR-AMC 

31.3 

Boc-FSR-AMC 

UD 

SNAAP 

Suc-IA-AMC 

UD 

*CL,  chymostrypsin-like;  PGPH,  peptidyl-glutamyl  peptide  hydrolyzing;  TL,  trypsin-like; 
SNAAP,  small  neutral  amino  acid  preferring. 

''Sue,  succinyl;  AMC,  7-amido-4-methyl-coumarin;  Cbz,  carbobenzoxy;  Benz,  N- 
benzoyl;  Boc,  r-butyloxycarbonyl;  PNa,  P-naphthylamine. 

'^Specific  activity  was  measured  in  micromoles  of  fluorogenic  product  •  min"'  •  mg 
protein"'.  Peptide  hydrolysis  was  measured  at  60°C  using  150  \iM  of  the  indicated 
substrates  in  25  mM  Tris  buffer  at  pH  7.5  containing  5  mM  DTT. 
UD,  undetectable. 
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recombinant  M.  jannaschii  20S  proteasome  (287).  The  trypsin-like  (TL)  activity  of  the 
Mni20S  proteasome  using  the  substrate  Benz-FVR-AMC  was  low  but  virtually  identical 
to  the  activity  previously  observed  for  the  Mj20S  proteasome  (287).  The  ]V[m20S 
proteasome  had  no  detectable  activity  when  the  substrate  Boc-FSR-AMC  was  used.  This 
was  not  surprising,  as  low  TL  activities  have  also  been  observed  for  other  archaeal  and 
eubacterial  proteasomes  (6, 156,  299),  in  contrast  to  mammalian  20S  proteasomes  which 
generally  have  high  TL  activity  (reviewed  in  197).  Neither  the  Mm20S  proteasome 
(Table  13)  or  the  Mj20S  proteasome  (287)  had  any  detectable  small  neutral  amino  acid 
preferring  (SNAAP)  activity.  The  LLVY-AMC  hydrolyzing  activity  of  the  Mm20S 
proteasome  was  also  slightly  stimulated  by  NaCl,  KCl,  and  MgCb  salts  by  1 .6-,  1 .4-,  and 
1.3-fold,  respectively  (data  not  shown).  Electron  microscopy  of  the  700-kDa  Mm20S 
proteasome  reveals  a  four-ringed  structure  of  12  by  17  nm  with  a  central  channel  (Fig. 
18).  This  is  consistent  with  other  archaeal  20S  proteasomes  which  have  been  purified 
including  those  from  M.  thermophila  (156)  and  T.  acidophilum  (41). 

Synthesis  of  MmPAN  and  MjPAN  in  Recombinant  M.  maripaludis 
The  genes  expressing  His6-tagged  MjPAN  and  MmPAN  as  well  as  untagged 
constructs  were  placed  into  the  expression  shuttle  vector  pWLG40  and  the  proteins  were 
synthesized  in  recombinant  M.  maripaludis  (See  Table  4  for  plasmid  descriptions).  The 
strains  producing  MjPAN  or  MmPAN  were  grown  to  mid-log  phase  and  harvested.  The 
MjPAN  and  MmPAN  proteins  that  were  synthesized  in  the  recombinant  strains  were 
detected  by  Western  blot  of  the  cell  lysate  (Fig.  19).  When  either  MmPAN  or  MjPAN 
were  synthesized  as  untagged  constructs  or  with  C-terminal  His6  tags,  overproduction 
was  easily  detected  by  Western  blot  of  the  cell  lysate  (Fig.  19,  Lanes  3,5,6,7,  and  9).  In 
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FIG.  18.  Transmission  electron  microscopy  of  purified  20S  proteasomes  from  M. 
maripaludis.  Side  views  in  which  the  four-ringed  structure  is  visible,and  end-on  views 
(inset)  can  be  seen. 
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contrast,  there  was  no  obvious  overproduction  of  MmPAN  when  synthesized  as  an  N- 
terminal  His6-tagged  construct,  and  only  slight  overproduction  of  N-  Hise-tagged  MjPAN 
could  be  detected  (Fig.  19,  Lanes  4  and  8).  Interestingly,  there  was  no  detectable 
synthesis  of  truncated  MjPAN(Al-73)  even  when  the  ORF  on  the  plasmid  contained  the 
wild-type  internal  translation  start  site  (Fig.  19,  Lane  6).  This  indicates  that  M 
maripaludis  cell  proteins  are  not  initiating  synthesis  from  the  internal  translation  start  site 
when  grown  under  these  conditions. 

Cell  lysate  of  the  M.  maripaludis  strains  producing  PAN  proteins  was  analyzed 
for  ATPase  activity  (Table  14).  In  general,  the  strains  producing  MmPAN  had  little  if  any 
increase  in  ATPase  activity  compared  to  the  wild-type  strain.  However,  the  cell  lysate 
from  the  M  maripaludis  strains  producing  MjPAN  had  a  significant  increase  of  ATPase 
activity  compared  to  the  wild-type  strain  (Table  14).  When  assayed  at  50°C,  the  ATPase 
activity  of  wild-type  M.  maripaludis,  strains  producing  MmPAN,  and  strains  producing 
MjPAN  with  N-terminal  Hise-tags  showed  little  change  in  ATPase  activity.  In  contrast, 
the  ATPase  activity  of  the  cell  lysate  of  M  maripaludis  producing  untagged  MjPAN 
increased  to  4-fold  the  activity  of  the  wild-type  strain  when  incubated  at  50°C  (Table  14). 
These  results  are  consistent  with  recombinant  E.  coli  studies  which  also  showed  that 
addition  of  an  N-terminal  His6-tag  can  reduce  the  thermostability  of  MjPAN. 

Purification  of  His6-tagged  MmPAN  from  Recombinant  M  maripaludis 

The  C-terminally  His6-tagged  MmPAN  from  recombinant  M.  maripaludis 
(pJAM738)  bound  to  a  Cu^^-Sepharose  affinity  column  and  was  eluted  using  500  mM 
imidazole  buffer  (Fig.  20).  When  the  MmPAN  protein  fraction  was  applied  to  a  Superose 
6  column,  the  protein  eluted  in  a  very  broad  peak,  and  ATP  hydrolysis  could  not  be 


TABLE  14.  Comparison  of  ATPase  activities  of  cell  lysate  of 
recombinant  M  maripaludis  strains. 


Plasmid 

Protein 
Overproduced 

%  Activity 
37°C 

a 

50°C 

None 

None 

lOO'' 

100' 

pWLG40  +  lacZ 

P-galactosidase 

103 

103 

pJAM738 

C-His6-MmPAN 

103 

94 

pJAM739 

N-His6-MmPAN 

119 

146 

pJAM737 

MmPAN 

126 

145 

pJAM741 

C-His6-MjPAN 

124 

183 

pJAM742 

C-His6-MjPAN 

240 

32S 

pJAM743 

N-His6-MjPAN 

130 

U4 

pJAM740 

MjPAN 

162 

416 

^  %Activity  is  the  ATPase  activity  compared  to  cell  lysate  from  wild-type  M 
maripaludis  without  any  plasmid.  Cell  lysate  was  assayed  in  20  mM  Tris  buffer 
at  pH  7.5  containing  100  mM  NaCl,  10  mM  MgCb ,  and  1  mM  ATP. 
''100%  Activity,  9.7  nmol  Pj  released  •  min"'  •  mg  protein'  at  37°C. 
''100%  Activity,  8.5  nmol  Pj  released  •  min'  •  mg  protein"'  at  50°C. 
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detected.  These  results  were  similar  to  those  obtained  during  purification  of  the  C- 
terminal  Hise-tagged  MmPAN  from  recombinant  E.  coli.  Taken  together,  these  results 
suggest  that  the  C-terminal  region  may  be  important  in  formation  of  the  MmPAN 
complex,  or  that  the  C-terminal  Hise-tag  may  disrupt  complex  formation. 


SUMMARY  AND  CONCLUSIONS 


The  results  of  this  study  provide  evidence  that  PAN  and  20S  proteasome  proteins 
are  produced  in  both  of  the  methanogenic  archaea  M.  jannaschii  and  M  maripaludis. 
This  study  also  examined  the  biochemical  and  physical  properties  of  the  M.  jannaschii 
PAN  protein.  Structural  characterization  of  MjPAN  by  electron  microscopy,  gel 
filtration,  and  non-denaturing  native  gel  electrophoresis  showed  that  the  protein  forms  a 
550  kDa  complex  of  approximately  12  subunits  which  is  stable  in  the  absence  of  ATP.  In 
addition,  partial  purification  of  MjPAN  from  M  jannaschii  reveals  that  it  also  associates 
into  a  550  kDa  complex  with  similar  biochemical  properties  as  the  recombinant  protein. 
Separate  biochemical  characterization  of  the  full-length  MjPAN  protein  and  the  N- 
termmally  truncated  MjPAN(Al-73)  protein  revealed  that  the  highly  charged  N-terminus 
of  MjPAN  modulates  the  rate  of  ATP  and  CTP  hydrolysis  and  provides  nucleotide 
specificity  to  the  active  site.  In  addition,  the  N-terminus  appeared  to  influence  the 
sensitivity  of  MjPAN  to  inhibitors  such  as  NEM  and  PCMS  as  well  as  affect  product 
inhibition  by  ADP  or  CDP.  Secondary  structure  analysis  suggests  that  an  a-helical, 
putative  coiled-coil  region  in  the  N-terminus  of  MjPAN  is  absent  in  the  truncated  protein 
and,  therefore,  may  play  an  important  role  in  stabilizing  MjPAN  at  extreme  temperatures. 
Similar  to  other  proteins  which  are  stable  at  extreme  temperature  and/or  salt  conditions,  it 
is  possible  that  ion-pairs,  hydrophobic  clusters,  or  other  structural  elements  are  required 
for  the  extreme  thermostability  of  MjPAN  and  that  these  are  disrupted  by  removal  of  the 
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N-terminal  peptide.  Taken  together,  the  results  of  this  study  suggest  that  the  full-length 
MjPAN  protein,  detected  in  M  jannaschii  using  Western  blot  analysis,  is  likely  the  native 
form  which  functions  at  the  extreme  growth  temperature  of  this  organism. 

This  study  also  demonstrated  that  MjPAN  stimulates  the  proteolytic  activity  of 
archaeal  20S  proteasomes  in  an  ATP-  or  CTP-dependent  maimer.  In  addition,  electron 
micrographs  suggested  that  MjPAN  and  the  20S  proteasome  associate  in  vitro  as  a 
complex  of  the  20S  proteolytic  core  capped  either  singly  or  doubly  with  the  MjPAN 
protein  complex.  This  complex  resembled  the  proteasome  Base  subcomplex  recently 
described  for  yeast  (72). 

The  mesophilic  M.  maripaludis  is  a  methanoarchaeon  for  which  genetic  methods 
and  high-level  culture  techniques  have  been  developed.  Therefore,  this  organism  was 
chosen  as  an  attractive  model  in  which  to  continue  study  of  ATP-dependent  proteolysis  in 
the  Archaea.  A  20S  proteasome  was  purified  from  M.  maripaludis  which  had  a 
surprisingly  high  temperature  optimum  of  80°C  for  hydrolysis  of  peptide  substrates, 
compared  to  its  growth  temperature  optimum  of  37°C.  Two  NTPase  complexes  were  also 
partially  purified  from  M.  maripaludis.  Both  had  NEM-inhibitable  ATPase  and  CTPase 
activity,  characteristic  of  AAA-ATPases  including  MjPAN.  However,  neither  NTPase 
cross-reacted  with  the  MjPAN  antibody.  Therefore,  a  M.  maripaludis  genomic  library 
was  constructed,  and  an  ORP  with  70%  identity  to  the  M.jannaschii  pan  gene  was 
isolated.  The  Mmpan  ORP  was  cloned  into  expression  vectors,  and  MmPAN  was 
purified  from  recombinant  E.  coli.  The  purified  MmPAN  protein  formed  a  dodecameric 
complex,  but  had  no  ATPase  or  CTPase  activity  and  was  imstable  at  50°C.  This 
suggested  that  the  recombinant  MmPAN  may  not  be  folding  properly  in  the  cytosol  of 


125 


recombinant  E.coli.  Thus,  to  further  characterize  PAN,  both  MmPAN  and  MjPAN  were 
over-produced  in  recombinant  M  maripaludis  as  C-  and  N-terminally  HiSg-tagged 
proteins  as  well  as  untagged  proteins.  The  purified  C-terminally  Hise-tagged  MmPAN 
had  biochemical  properties  similar  to  the  MmPAN  protein  produced  in  recombinant 
E.coli.  Sequence  alignment  of  MjPAN  and  MmPAN  revealed  that  MmPAN  lacked  the 
highly  charged  amino-terminus  of  MjPAN.  This  may  explain  the  lack  of  thermostability 
of  recombinant  MmPAN.  The  reason  MmPAN  lacks  ATPase  activity  remains  to  be 
determined. 

Current  and  future  studies  of  this  project  are  also  focused  on  determining  proteins 
which  may  interact  with  PAN  and  the  20S  proteasome  in  vivo.  Using  the  His6-tagged 
constructs  of  PAN  from  this  study  may  identify  proteins  which  interact  with  the 
proteasome  in  vivo.  This  is  based  on  a  recent  study  of  the  yeast  proteasome  which  used 
affinity-tagged  constructs  to  rapidly  purify  intact  26S  proteasomes  and  RP  (19S  cap) 
complexes  from  the  cytosol  (267).  Mass  spectrometry  techniques  were  then  used  to 
identify  24  previously  undiscovered  proteins  which  interacted  with  the  26S  proteasome. 

The  recently  completed  genome  sequence  of  M.  maripaludis,  combined  with  the 
genetic  techniques  which  have  been  developed  for  use  in  this  methanogenic 
microorganism,  should  also  aid  future  studies  of  the  proteasome  in  Archaea.  Further 
characterization  of  MjPAN  and  MmPAN  produced  in  recombinant  M.  maripaludis 
should  reveal  more  about  the  similarities  and  differences  between  these  two  proteins,  one 
from  a  mesophile  and  the  other  from  a  hyper-thermophile. 
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